Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
7101 | fi.wikipedia.org | 8304 | 5.16 | 200 | HTML 5, No Lang |
7102 | esrb.org | 8305 | 5.16 | 200 | HTML 5, English |
7103 | wyzowl.com | 8306 | 5.16 | 200 | HTML 5, English |
7104 | dailywire.com | 8307 | 5.16 | 200 | HTML 5, No Lang |
7105 | warbyparker.com | 8308 | 5.16 | 200 | HTML 5, English |
7106 | tvnz.co.nz | 8309 | 5.16 | 200 | HTML 5, No Lang |
7107 | catalog.hathitrust.org | 8311 | 5.16 | 200 | HTML 5, English |
7108 | commondreams.org | 8312 | 5.16 | 200 | HTML 5, English |
7109 | georgiaaquarium.org | 8313 | 5.16 | 200 | HTML 5, English |
7110 | classy.org | 8314 | 5.16 | 200 | HTML 5, English |
7111 | clickatell.com | 8315 | 5.16 | 200 | HTML 5, English |
7112 | winners.webbyawards.com | 8316 | 5.16 | 200 | HTML 5, English |
7113 | linksys.com | 8317 | 5.16 | 200 | HTML 5, English |
7114 | dealbook.nytimes.com | 8318 | 5.16 | 200 | HTML 5, English |
7115 | frankfurt.de | 8320 | 5.16 | 200 | HTML 5 |
7116 | uni-saarland.de | 8322 | 5.16 | 200 | HTML 5 |
7117 | pdfhost.io | 8323 | 5.16 | 200 | HTML 5, English |
7118 | opensource.apple.com | 8324 | 5.16 | 200 | HTML 5, English |
7119 | bepress.com | 8326 | 5.16 | 200 | HTML 5, English |
7120 | simplywhisked.com | 8327 | 5.16 | 200 | HTML 5, English |
7121 | bundestag.de | 8329 | 5.16 | 200 | HTML 5 |
7122 | coolsymbol.com | 8330 | 5.16 | 200 | HTML 5, English |
7123 | forestapp.cc | 8331 | 5.16 | 200 | HTML 5, English |
7124 | ourcommons.ca | 8332 | 5.16 | 200 | HTML 5, English |
7125 | auto-motor-und-sport.de | 8333 | 5.16 | 200 | HTML 5 |
7126 | snyk.io | 8334 | 5.16 | 200 | HTML 5, English |
7127 | dl.google.com | 8335 | 5.16 | 200 | HTML 5, English |
7128 | svn.apache.org | 8336 | 5.16 | 200 | No Lang |
7129 | latex-project.org | 8337 | 5.16 | 200 | HTML 5, English |
7130 | baeldung.com | 8338 | 5.16 | 200 | HTML 5, English |
7131 | maib.md | 8339 | 5.16 | 200 | HTML 5 |
7132 | reacttraining.com | 8340 | 5.16 | 200 | HTML 5, English |
7133 | rambler.ru | 8341 | 5.16 | 200 | HTML 5 |
7134 | promptbase.com | 8342 | 5.16 | 200 | HTML 5, English |
7135 | arizona.edu | 8343 | 5.16 | 200 | HTML 5, English |
7136 | fr.calameo.com | 8345 | 5.16 | 200 | HTML 5 |
7137 | france.tv | 8346 | 5.16 | 200 | HTML 5 |
7138 | ilm.com | 8347 | 5.16 | 200 | HTML 5, English |
7139 | hometalk.com | 8348 | 5.16 | 200 | HTML 5, English |
7140 | goop.com | 8350 | 5.16 | 200 | HTML 5, English |
7141 | revisor.mn.gov | 8351 | 5.16 | 200 | HTML 5, English |
7142 | ziare.com | 8352 | 5.16 | 200 | HTML 5 |
7143 | vi.wikipedia.org | 8353 | 5.16 | 200 | HTML 5, No Lang |
7144 | blogs.skype.com | 8354 | 5.16 | 200 | HTML 5, English |
7145 | kyivpost.com | 8355 | 5.16 | 200 | HTML 5, No Lang |
7146 | onionshare.org | 8356 | 5.16 | 200 | HTML 5, English |
7147 | cuhk.edu.hk | 8357 | 5.16 | 200 | No Lang |
7148 | bpost.be | 8359 | 5.16 | 200 | HTML 5, English |
7149 | fullcontact.com | 8360 | 5.16 | 200 | HTML 5, English |
7150 | data.un.org | 8361 | 5.16 | 200 | No Lang, Transitional |
7151 | listal.com | 8362 | 5.16 | 200 | HTML 5, English |
7152 | remodelista.com | 8363 | 5.16 | 200 | HTML 5, English |
7153 | nfb.org | 8364 | 5.16 | 200 | HTML 5, English |
7154 | netbeans.org | 8365 | 5.16 | 200 | HTML 5, English |
7155 | abcmouse.com | 8366 | 5.16 | 200 | HTML 5, English |
7156 | unix.stackexchange.com | 8367 | 5.16 | 200 | HTML 5, English |
7157 | cftc.gov | 8368 | 5.16 | 200 | HTML 5, English |
7158 | hotstar.com | 8370 | 5.16 | 200 | HTML 5, English |
7159 | leagueoflegends.com | 8371 | 5.16 | 200 | HTML 5, English |
7160 | mediapart.fr | 8372 | 5.16 | 200 | HTML 5 |
7161 | fastwork.co | 8374 | 5.16 | 200 | HTML 5 |
7162 | 3playmedia.com | 8375 | 5.16 | 200 | HTML 5, English |
7163 | languagelog.ldc.upenn.edu | 8376 | 5.16 | 200 | No Lang, Transitional |
7164 | vodafone.com.au | 8377 | 5.16 | 200 | HTML 5, English |
7165 | endnote.com | 8378 | 5.16 | 200 | HTML 5, English |
7166 | monoskop.org | 8379 | 5.16 | 200 | HTML 5, English |
7167 | bcu.ac.uk | 8380 | 5.16 | 200 | HTML 5, English |
7168 | writersdigest.com | 8382 | 5.16 | 200 | HTML 5, English |
7169 | admob.google.com | 8383 | 5.16 | 200 | HTML 5, English |
7170 | mailpoet.com | 8384 | 5.16 | 200 | HTML 5, English |
7171 | io9.gizmodo.com | 8385 | 5.16 | 200 | HTML 5, English |
7172 | music.line.me | 8386 | 5.16 | 200 | HTML 5 |
7173 | sandiegozoo.org | 8387 | 5.16 | 200 | HTML 5, English |
7174 | news.yale.edu | 8388 | 5.16 | 200 | HTML 5, English |
7175 | newindianexpress.com | 8390 | 5.16 | 200 | HTML 5, English |
7176 | courierpress.com | 8391 | 5.16 | 200 | HTML 5, English |
7177 | theleanstartup.com | 8393 | 5.16 | 200 | No Lang, Strict |
7178 | ideas.lego.com | 8394 | 5.16 | 200 | HTML 5, English |
7179 | umt.edu | 8395 | 5.16 | 200 | HTML 5, English |
7180 | namu.wiki | 8396 | 5.16 | 200 | HTML 5 |
7181 | globaldata.com | 8397 | 5.16 | 200 | HTML 5, English |
7182 | liberty.edu | 8398 | 5.16 | 200 | HTML 5, English |
7183 | tennessean.com | 8399 | 5.16 | 200 | HTML 5, English |
7184 | jdsports.com | 8400 | 5.16 | 200 | No Lang |
7185 | waterkeeper.org | 8401 | 5.16 | 200 | HTML 5, English |
7186 | ttb.gov | 8402 | 5.16 | 200 | HTML 5, English |
7187 | extensions.joomla.org | 8403 | 5.16 | 200 | HTML 5, English |
7188 | revistas.unal.edu.co | 8404 | 5.16 | 200 | HTML 5 |
7189 | moz.de | 8405 | 5.16 | 200 | HTML 5 |
7190 | iledefrance.fr | 8406 | 5.16 | 200 | HTML 5 |
7191 | tvseriesfinale.com | 8407 | 5.16 | 200 | HTML 5, English |
7192 | moddb.com | 8408 | 5.16 | 200 | HTML 5, English |
7193 | opensource.googleblog.com | 8410 | 5.16 | 200 | HTML 5, No Lang |
7194 | gia.edu | 8411 | 5.16 | 200 | HTML 5, English |
7195 | 10up.com | 8412 | 5.16 | 200 | HTML 5, English |
7196 | design-milk.com | 8413 | 5.16 | 200 | HTML 5, English |
7197 | open.canada.ca | 8414 | 5.16 | 200 | HTML 5, English |
7198 | boosty.to | 8415 | 5.16 | 200 | HTML 5 |
7199 | manuals.info.apple.com | 8416 | 5.16 | 200 | HTML 5, English |
7200 | walkscore.com | 8417 | 5.16 | 200 | HTML 5, No Lang |
Data from: Open PageRank