Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
11201 | html2canvas.hertzen.com | 13048 | 5.01 | 200 | HTML 5, No Lang |
11202 | penzu.com | 13049 | 5.01 | 200 | HTML 5, English |
11203 | shop.mattel.com | 13050 | 5.01 | 200 | HTML 5, English |
11204 | timeshighereducation.co.uk | 13051 | 5.01 | 200 | HTML 5, English |
11205 | themegrill.com | 13053 | 5.01 | 200 | HTML 5, English |
11206 | delltechnologies.com | 13054 | 5.01 | 200 | HTML 5, English |
11207 | musicglue.com | 13055 | 5.01 | 200 | HTML 5, English |
11208 | informaconnect.com | 13056 | 5.01 | 200 | HTML 5, English |
11209 | sites.ed.gov | 13057 | 5.01 | 200 | HTML 5, English |
11210 | thatgamecompany.com | 13058 | 5.01 | 200 | HTML 5, English |
11211 | follow.it | 13061 | 5.01 | 200 | HTML 5, English |
11212 | altmetric.com | 13063 | 5.01 | 200 | HTML 5, English |
11213 | carrismetropolitana.pt | 13064 | 5.01 | 200 | HTML 5, English |
11214 | mapion.co.jp | 13066 | 5.01 | 200 | HTML 5 |
11215 | wassenaar.org | 13067 | 5.01 | 200 | HTML 5, English |
11216 | serve.com | 13068 | 5.01 | 200 | HTML 5, English |
11217 | globalwellnessinstitute.org | 13069 | 5.01 | 200 | HTML 5, English |
11218 | delcampe.net | 13070 | 5.01 | 200 | HTML 5, English |
11219 | swr3.de | 13071 | 5.01 | 200 | HTML 5 |
11220 | ccmixter.org | 13072 | 5.01 | 200 | English, Strict |
11221 | diaart.org | 13073 | 5.01 | 200 | HTML 5, English |
11222 | womentechmakers.com | 13074 | 5.01 | 200 | HTML 5, English |
11223 | memri.org | 13075 | 5.01 | 200 | English |
11224 | corrieredellosport.it | 13076 | 5.01 | 200 | HTML 5 |
11225 | bia.gov | 13077 | 5.01 | 200 | HTML 5, English |
11226 | lecremedelacrumb.com | 13078 | 5.01 | 200 | HTML 5, English |
11227 | ventureharbour.com | 13079 | 5.01 | 200 | HTML 5, English |
11228 | interiordesign.net | 13081 | 5.01 | 200 | HTML 5, English |
11229 | vodafone.com.eg | 13082 | 5.01 | 200 | HTML 5, English |
11230 | markets.ft.com | 13083 | 5.01 | 200 | HTML 5, English |
11231 | rockhall.com | 13084 | 5.01 | 200 | HTML 5, English |
11232 | paulekman.com | 13085 | 5.01 | 200 | HTML 5, English |
11233 | hyperledger.org | 13086 | 5.01 | 200 | HTML 5, English |
11234 | newscenter.lbl.gov | 13087 | 5.01 | 200 | HTML 5, English |
11235 | thereformation.com | 13088 | 5.01 | 200 | HTML 5, English |
11236 | eventscribe.com | 13089 | 5.01 | 200 | HTML 5, English |
11237 | kshb.com | 13091 | 5.01 | 200 | HTML 5, English |
11238 | viewbug.com | 13092 | 5.01 | 200 | HTML 5, English |
11239 | math.ucsd.edu | 13093 | 5.01 | 200 | HTML 5, English |
11240 | turnkeylinux.org | 13094 | 5.01 | 200 | English |
11241 | oebb.at | 13095 | 5.01 | 200 | HTML 5 |
11242 | freakytrigger.co.uk | 13096 | 5.01 | 200 | No Lang, Transitional |
11243 | nal.usda.gov | 13097 | 5.01 | 200 | HTML 5, English |
11244 | tuprints.ulb.tu-darmstadt.de | 13098 | 5.01 | 200 | No Lang, Transitional |
11245 | people.maths.ox.ac.uk | 13099 | 5.01 | 200 | HTML 5, English |
11246 | gutenberg.net.au | 13100 | 5.01 | 200 | No Lang, Transitional |
11247 | americasbestpics.com | 13101 | 5.01 | 200 | HTML 5, English |
11248 | reportlinker.com | 13102 | 5.01 | 200 | HTML 5, English |
11249 | adpushup.com | 13103 | 5.01 | 200 | HTML 5, English |
11250 | square-enix-games.com | 13104 | 5.01 | 200 | HTML 5, English |
11251 | citymapper.com | 13105 | 5.01 | 200 | HTML 5, No Lang |
11252 | thejc.com | 13107 | 5.01 | 200 | HTML 5, English |
11253 | hurriyet.com.tr | 13108 | 5.01 | 200 | HTML 5 |
11254 | dataverse.harvard.edu | 13109 | 5.01 | 200 | English |
11255 | powerapps.microsoft.com | 13110 | 5.01 | 200 | HTML 5, English |
11256 | keloland.com | 13112 | 5.01 | 200 | HTML 5, English |
11257 | mid.ru | 13113 | 5.01 | 200 | HTML 5, No Lang |
11258 | dictionary.goo.ne.jp | 13114 | 5.01 | 200 | HTML 5 |
11259 | liquipedia.net | 13115 | 5.01 | 200 | HTML 5, English |
11260 | windtre.it | 13116 | 5.01 | 200 | HTML 5 |
11261 | admin.typeform.com | 13117 | 5.01 | 200 | No Lang |
11262 | forum.wordreference.com | 13118 | 5.01 | 200 | HTML 5, English |
11263 | kopashopping.com | 13119 | 5.01 | 200 | HTML 5, English |
11264 | sgvtribune.com | 13120 | 5.01 | 200 | HTML 5, English |
11265 | uxpin.com | 13121 | 5.01 | 200 | HTML 5, English |
11266 | ohio.gov | 13122 | 5.01 | 200 | HTML 5, English |
11267 | nyse.com | 13123 | 5.01 | 200 | HTML 5, English |
11268 | hiro.so | 13124 | 5.01 | 200 | HTML 5, English |
11269 | fenrir-inc.com | 13125 | 5.01 | 200 | HTML 5, No Lang |
11270 | owler.com | 13126 | 5.01 | 200 | HTML 5, English |
11271 | clevelandfed.org | 13128 | 5.01 | 200 | HTML 5, English |
11272 | medel.com | 13129 | 5.01 | 200 | HTML 5, English |
11273 | comic-con.org | 13131 | 5.01 | 200 | HTML 5, English |
11274 | google.com.sa | 13133 | 5.01 | 200 | HTML 5, English |
11275 | new.qq.com | 13134 | 5.01 | 200 | HTML 5 |
11276 | fluke.com | 13135 | 5.01 | 200 | HTML 5, English |
11277 | architectureartdesigns.com | 13136 | 5.01 | 200 | HTML 5, English |
11278 | ica.gov.sg | 13137 | 5.01 | 200 | HTML 5, English |
11279 | madebymike.com.au | 13138 | 5.01 | 200 | HTML 5, English |
11280 | brynmawr.edu | 13139 | 5.01 | 200 | HTML 5, English |
11281 | web.musc.edu | 13140 | 5.01 | 200 | HTML 5, No Lang |
11282 | reactome.org | 13142 | 5.01 | 200 | HTML 5, English |
11283 | jeuneafrique.com | 13144 | 5.01 | 200 | HTML 5 |
11284 | blog.tensorflow.org | 13145 | 5.01 | 200 | HTML 5, English |
11285 | foodiewithfamily.com | 13146 | 5.01 | 200 | HTML 5, English |
11286 | internacional.elpais.com | 13148 | 5.01 | 200 | HTML 5 |
11287 | jku.at | 13150 | 5.01 | 200 | HTML 5 |
11288 | indiapost.gov.in | 13151 | 5.01 | 200 | English |
11289 | ase.tufts.edu | 13152 | 5.01 | 200 | HTML 5, English |
11290 | dia.org | 13153 | 5.01 | 200 | HTML 5, English |
11291 | nhmrc.gov.au | 13154 | 5.01 | 200 | HTML 5, English |
11292 | pflanzmich.de | 13155 | 5.01 | 200 | HTML 5 |
11293 | djangogirls.org | 13156 | 5.01 | 200 | HTML 5, No Lang |
11294 | dokuwiki.org | 13157 | 5.01 | 200 | HTML 5, English |
11295 | journal-news.com | 13158 | 5.01 | 200 | HTML 5, English |
11296 | exame.com | 13159 | 5.01 | 200 | HTML 5 |
11297 | twitchtracker.com | 13160 | 5.01 | 200 | HTML 5, English |
11298 | renderosity.com | 13161 | 5.01 | 200 | HTML 5, English |
11299 | grinnell.edu | 13162 | 5.01 | 200 | HTML 5, English |
11300 | dph.georgia.gov | 13163 | 5.01 | 200 | HTML 5, English |
Data from: Open PageRank