Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
14001 | ricoh.com | 16314 | 4.95 | 200 | HTML 5, English |
14002 | neweracap.com | 16315 | 4.95 | 200 | HTML 5, No Lang |
14003 | pypl.github.io | 16317 | 4.95 | 200 | HTML 5, English |
14004 | ricksteves.com | 16318 | 4.95 | 200 | HTML 5, No Lang |
14005 | simpleanalytics.com | 16320 | 4.95 | 200 | HTML 5, English |
14006 | thenewhumanitarian.org | 16321 | 4.95 | 200 | HTML 5, English |
14007 | 11alive.com | 16323 | 4.95 | 200 | HTML 5, English |
14008 | mycharitywater.org | 16324 | 4.95 | 200 | HTML 5, English |
14009 | alice.org | 16325 | 4.95 | 200 | HTML 5, English |
14010 | privacyrights.org | 16327 | 4.95 | 200 | HTML 5, English |
14011 | templatelab.com | 16328 | 4.95 | 200 | HTML 5, English |
14012 | press.umich.edu | 16329 | 4.95 | 200 | HTML 5, English |
14013 | caml.inria.fr | 16330 | 4.95 | 200 | English, Strict |
14014 | guitarworld.com | 16331 | 4.95 | 200 | HTML 5, English |
14015 | qgis.org | 16332 | 4.95 | 200 | HTML 5, English |
14016 | corestandards.org | 16333 | 4.95 | 200 | HTML 5, English |
14017 | makeawebsitehub.com | 16334 | 4.95 | 200 | HTML 5, English |
14018 | climatechangenews.com | 16335 | 4.95 | 200 | HTML 5, English |
14019 | theimpulsivebuy.com | 16336 | 4.95 | 200 | HTML 5, English |
14020 | anses.fr | 16337 | 4.95 | 200 | HTML 5 |
14021 | eurasianet.org | 16338 | 4.95 | 200 | HTML 5, English |
14022 | docs.geoserver.org | 16339 | 4.95 | 200 | English, Transitional |
14023 | jshint.com | 16341 | 4.95 | 200 | HTML 5, English |
14024 | fourcc.org | 16343 | 4.95 | 200 | No Lang, Transitional |
14025 | irishnews.com | 16345 | 4.95 | 200 | HTML 5, English |
14026 | tuwien.ac.at | 16346 | 4.95 | 200 | HTML 5 |
14027 | misfitsmarket.com | 16348 | 4.95 | 200 | HTML 5, English |
14028 | cadena3.com | 16349 | 4.95 | 200 | HTML 5, English |
14029 | cpuid.com | 16350 | 4.95 | 200 | HTML 5, English |
14030 | prisonpolicy.org | 16351 | 4.95 | 200 | HTML 5, English |
14031 | kob.com | 16352 | 4.95 | 200 | HTML 5, English |
14032 | grdf.fr | 16353 | 4.95 | 200 | HTML 5 |
14033 | sg.theasianparent.com | 16354 | 4.95 | 200 | HTML 5, English |
14034 | magicleap.com | 16357 | 4.95 | 200 | HTML 5, English |
14035 | maersk.com | 16358 | 4.95 | 200 | HTML 5, English |
14036 | moncompteformation.gouv.fr | 16359 | 4.95 | 200 | HTML 5 |
14037 | mithril.js.org | 16360 | 4.95 | 200 | HTML 5, English |
14038 | houzz.co.uk | 16361 | 4.95 | 200 | HTML 5, English |
14039 | io.google | 16362 | 4.95 | 200 | HTML 5, English |
14040 | mbc.net | 16364 | 4.95 | 200 | HTML 5, English |
14041 | news-journalonline.com | 16366 | 4.95 | 200 | HTML 5, English |
14042 | lvmh.com | 16367 | 4.95 | 200 | HTML 5, No Lang |
14043 | stampinup.com | 16368 | 4.95 | 200 | HTML 5, English |
14044 | iris.uniroma1.it | 16369 | 4.95 | 200 | HTML 5 |
14045 | rouleur.cc | 16370 | 4.95 | 200 | HTML 5, English |
14046 | cambridge-news.co.uk | 16371 | 4.95 | 200 | HTML 5, English |
14047 | telecomtv.com | 16372 | 4.95 | 200 | HTML 5, English |
14048 | logotv.com | 16373 | 4.95 | 200 | HTML 5, English |
14049 | volkswagen.de | 16376 | 4.95 | 200 | HTML 5 |
14050 | lipsum.com | 16377 | 4.95 | 200 | HTML 5, English |
14051 | cis.minsk.by | 16378 | 4.95 | 200 | HTML 5 |
14052 | pedestrian.tv | 16380 | 4.95 | 200 | HTML 5, English |
14053 | worrydream.com | 16381 | 4.95 | 200 | HTML 5, English |
14054 | heritage-history.com | 16382 | 4.95 | 200 | HTML 5, No Lang |
14055 | croatiaairlines.com | 16384 | 4.95 | 200 | HTML 5, English |
14056 | orange.md | 16385 | 4.95 | 200 | HTML 5 |
14057 | stbaldricks.org | 16386 | 4.95 | 200 | HTML 5, English |
14058 | help.qlik.com | 16387 | 4.95 | 200 | HTML 5, English |
14059 | bitcoinist.com | 16388 | 4.95 | 200 | HTML 5 |
14060 | makery.info | 16390 | 4.95 | 200 | |
14061 | dlmf.nist.gov | 16392 | 4.95 | 200 | English, Strict |
14062 | um.es | 16394 | 4.95 | 200 | HTML 5 |
14063 | lavuelta.com | 16396 | 4.95 | 200 | HTML 5, English |
14064 | healthy.kaiserpermanente.org | 16398 | 4.95 | 200 | HTML 5, English |
14065 | servustv.com | 16399 | 4.95 | 200 | HTML 5 |
14066 | nntp.perl.org | 16400 | 4.95 | 200 | No Lang, Transitional |
14067 | picryl.com | 16401 | 4.95 | 200 | HTML 5, English |
14068 | open.umn.edu | 16403 | 4.95 | 200 | HTML 5, English |
14069 | us.etrade.com | 16404 | 4.95 | 200 | HTML 5, English |
14070 | tether.to | 16405 | 4.95 | 200 | HTML 5, No Lang |
14071 | journals.tdl.org | 16406 | 4.95 | 200 | HTML 5, English |
14072 | podlove.org | 16407 | 4.95 | 200 | HTML 5, English |
14073 | good.is | 16408 | 4.95 | 200 | HTML 5, English |
14074 | journalstar.com | 16410 | 4.95 | 200 | HTML 5, English |
14075 | xubuntu.org | 16411 | 4.95 | 200 | HTML 5, English |
14076 | herroom.com | 16412 | 4.95 | 200 | HTML 5, English |
14077 | blog.golang.org | 16413 | 4.95 | 200 | HTML 5, English |
14078 | dosomething.org | 16414 | 4.95 | 200 | HTML 5, English |
14079 | restoreprivacy.com | 16415 | 4.95 | 200 | HTML 5, English |
14080 | santafenewmexican.com | 16417 | 4.95 | 200 | HTML 5, English |
14081 | bair.berkeley.edu | 16418 | 4.95 | 200 | HTML 5, English |
14082 | dalailama.com | 16419 | 4.95 | 200 | HTML 5, English |
14083 | ultimateears.com | 16420 | 4.95 | 200 | HTML 5, English |
14084 | contentsquare.com | 16422 | 4.95 | 200 | HTML 5, English |
14085 | protege.stanford.edu | 16423 | 4.95 | 200 | HTML 5, English |
14086 | banuba.com | 16424 | 4.95 | 200 | HTML 5, English |
14087 | radionz.co.nz | 16425 | 4.95 | 200 | HTML 5, English |
14088 | thetimezoneconverter.com | 16426 | 4.95 | 200 | HTML 5, No Lang |
14089 | businessofhome.com | 16427 | 4.95 | 200 | HTML 5, No Lang |
14090 | aifa.gov.it | 16428 | 4.95 | 200 | HTML 5 |
14091 | nti.org | 16429 | 4.95 | 200 | HTML 5, English |
14092 | silverlake.com | 16430 | 4.95 | 200 | HTML 5, English |
14093 | alleninstitute.org | 16431 | 4.95 | 200 | HTML 5, English |
14094 | visitguernsey.com | 16432 | 4.95 | 200 | HTML 5, No Lang |
14095 | webmasterworld.com | 16433 | 4.95 | 200 | HTML 5, English |
14096 | kwikset.com | 16434 | 4.95 | 200 | HTML 5, English |
14097 | collections.mfa.org | 16435 | 4.95 | 200 | HTML 5, English |
14098 | openarchives.org | 16436 | 4.95 | 200 | English, Strict |
14099 | science.gov | 16437 | 4.95 | 200 | HTML 5, English |
14100 | freedcamp.com | 16438 | 4.95 | 200 | HTML 5, English |
Data from: Open PageRank