Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
9701 | openbadges.org | 11307 | 5.05 | 200 | HTML 5, English |
9702 | latrobe.edu.au | 11308 | 5.05 | 200 | HTML 5, English |
9703 | smex.org | 11309 | 5.05 | 200 | HTML 5, No Lang |
9704 | krugman.blogs.nytimes.com | 11310 | 5.05 | 200 | HTML 5, English |
9705 | denver.org | 11311 | 5.05 | 200 | HTML 5, English |
9706 | osmosis.zone | 11312 | 5.05 | 200 | HTML 5, English |
9707 | artlebedev.ru | 11313 | 5.05 | 200 | |
9708 | reactos.org | 11314 | 5.05 | 200 | HTML 5, English |
9709 | frontier.com | 11316 | 5.05 | 200 | HTML 5, English |
9710 | hauteliving.com | 11317 | 5.05 | 200 | HTML 5, English |
9711 | userbase.kde.org | 11318 | 5.05 | 200 | HTML 5, No Lang |
9712 | devguide.python.org | 11319 | 5.05 | 200 | HTML 5, English |
9713 | naturalreaders.com | 11320 | 5.05 | 200 | HTML 5, English |
9714 | supreme.justia.com | 11321 | 5.05 | 200 | HTML 5, English |
9715 | idioms.thefreedictionary.com | 11322 | 5.05 | 200 | HTML 5, No Lang |
9716 | maps.yahoo.com | 11323 | 5.05 | 200 | HTML 5, English |
9717 | bluestacks.com | 11324 | 5.05 | 200 | HTML 5, English |
9718 | atlantafed.org | 11325 | 5.05 | 200 | HTML 5, English |
9719 | design.tutsplus.com | 11326 | 5.05 | 200 | HTML 5, English |
9720 | wisegeek.com | 11327 | 5.05 | 200 | HTML 5, English |
9721 | databank.worldbank.org | 11328 | 5.05 | 200 | HTML 5, English |
9722 | uni-konstanz.de | 11329 | 5.05 | 200 | HTML 5 |
9723 | kp.org | 11330 | 5.05 | 200 | HTML 5, English |
9724 | keap.com | 11331 | 5.05 | 200 | HTML 5, English |
9725 | webershandwick.com | 11332 | 5.05 | 200 | HTML 5, English |
9726 | addictivetips.com | 11333 | 5.05 | 200 | HTML 5, English |
9727 | christianpost.com | 11334 | 5.05 | 200 | HTML 5, English |
9728 | edreams.com | 11335 | 5.05 | 200 | HTML 5, English |
9729 | topchretien.com | 11336 | 5.05 | 200 | HTML 5 |
9730 | whatsform.com | 11337 | 5.05 | 200 | HTML 5, English |
9731 | calibre-ebook.com | 11338 | 5.05 | 200 | HTML 5, English |
9732 | grabcad.com | 11340 | 5.05 | 200 | HTML 5, English |
9733 | archlinux.org | 11341 | 5.05 | 200 | HTML 5, English |
9734 | mymsaa.org | 11342 | 5.05 | 200 | HTML 5, English |
9735 | courses.edx.org | 11343 | 5.05 | 200 | HTML 5, English |
9736 | law.berkeley.edu | 11344 | 5.05 | 200 | HTML 5, English |
9737 | pubs.er.usgs.gov | 11345 | 5.05 | 200 | HTML 5, English |
9738 | rebellion.earth | 11346 | 5.05 | 200 | HTML 5, English |
9739 | arri.com | 11347 | 5.05 | 200 | HTML 5, English |
9740 | brandequity.economictimes.indiatimes.com | 11349 | 5.05 | 200 | HTML 5, English |
9741 | delfi.lv | 11350 | 5.05 | 200 | HTML 5 |
9742 | popmatters.com | 11351 | 5.05 | 200 | HTML 5, English |
9743 | javascript.com | 11352 | 5.05 | 200 | HTML 5, English |
9744 | implicit.harvard.edu | 11354 | 5.05 | 200 | English, Strict |
9745 | coderwall.com | 11355 | 5.05 | 200 | HTML 5, English |
9746 | piwik.pro | 11356 | 5.05 | 200 | HTML 5, English |
9747 | bassistance.de | 11357 | 5.05 | 200 | HTML 5, No Lang |
9748 | uco.es | 11358 | 5.05 | 200 | HTML 5 |
9749 | alliantcreditunion.org | 11359 | 5.05 | 200 | HTML 5, English |
9750 | memory.loc.gov | 11360 | 5.05 | 200 | HTML 5, English |
9751 | remarkable.com | 11362 | 5.05 | 200 | HTML 5, English |
9752 | israelnationalnews.com | 11364 | 5.05 | 200 | HTML 5, English |
9753 | international.gc.ca | 11365 | 5.05 | 200 | HTML 5, English |
9754 | climatecentral.org | 11366 | 5.05 | 200 | HTML 5, English |
9755 | in.explara.com | 11367 | 5.05 | 200 | HTML 5, English |
9756 | nibusinessinfo.co.uk | 11368 | 5.05 | 200 | HTML 5, English |
9757 | tum.de | 11369 | 5.05 | 200 | HTML 5 |
9758 | aisel.aisnet.org | 11370 | 5.05 | 200 | HTML 5, English |
9759 | sway.com | 11371 | 5.05 | 200 | English |
9760 | journalism.co.uk | 11372 | 5.05 | 200 | HTML 5, No Lang |
9761 | nltimes.nl | 11373 | 5.05 | 200 | HTML 5, English |
9762 | rustybrick.com | 11374 | 5.05 | 200 | English, Strict |
9763 | uxbooth.com | 11375 | 5.05 | 200 | HTML 5, English |
9764 | seenthis.co | 11377 | 5.05 | 200 | HTML 5, English |
9765 | lge.com | 11378 | 5.05 | 200 | HTML 5, English |
9766 | wimbledon.com | 11380 | 5.05 | 200 | HTML 5, English |
9767 | agefotostock.com | 11381 | 5.05 | 200 | HTML 5, No Lang |
9768 | rammb.cira.colostate.edu | 11383 | 5.05 | 200 | HTML 5, English |
9769 | home.snafu.de | 11384 | 5.05 | 200 | HTML 5 |
9770 | mars.jpl.nasa.gov | 11385 | 5.05 | 200 | HTML 5, English |
9771 | news.cision.com | 11386 | 5.05 | 200 | HTML 5, No Lang |
9772 | qooh.me | 11388 | 5.05 | 200 | English, Strict |
9773 | paxful.com | 11389 | 5.05 | 200 | HTML 5, English |
9774 | visualstudio.com | 11391 | 5.05 | 200 | HTML 5, English |
9775 | math.uwaterloo.ca | 11392 | 5.05 | 200 | HTML 5, English |
9776 | joxi.ru | 11393 | 5.05 | 200 | HTML 5, No Lang |
9777 | letras.com | 11395 | 5.05 | 200 | HTML 5, English |
9778 | irma.nps.gov | 11396 | 5.05 | 200 | HTML 5, English |
9779 | hollandamerica.com | 11397 | 5.05 | 200 | HTML 5, English |
9780 | kjrh.com | 11398 | 5.05 | 200 | HTML 5, English |
9781 | writesonic.com | 11399 | 5.05 | 200 | HTML 5, English |
9782 | sharecare.com | 11400 | 5.05 | 200 | HTML 5, English |
9783 | sheffield.gov.uk | 11401 | 5.05 | 200 | HTML 5, English |
9784 | benjaminmoore.com | 11402 | 5.05 | 200 | HTML 5, English |
9785 | bandai.co.jp | 11403 | 5.05 | 200 | HTML 5 |
9786 | americanthinker.com | 11404 | 5.05 | 200 | HTML 5, No Lang |
9787 | arthritis.org | 11405 | 5.05 | 200 | HTML 5, English |
9788 | openwebanalytics.com | 11407 | 5.05 | 200 | HTML 5, English |
9789 | blockcypher.com | 11409 | 5.05 | 200 | HTML 5, English |
9790 | imi.europa.eu | 11410 | 5.05 | 200 | HTML 5, English |
9791 | industryweek.com | 11411 | 5.05 | 200 | HTML 5, English |
9792 | id.linkedin.com | 11413 | 5.05 | 200 | HTML 5 |
9793 | drei.at | 11414 | 5.05 | 200 | No Lang, Transitional |
9794 | usaultimate.org | 11415 | 5.05 | 200 | HTML 5, English |
9795 | one.one.one.one | 11416 | 5.05 | 200 | HTML 5, English |
9796 | bundesliga.com | 11417 | 5.05 | 200 | HTML 5, English |
9797 | tcl.tk | 11419 | 5.05 | 200 | No Lang, Transitional |
9798 | sfcollege.edu | 11420 | 5.05 | 200 | HTML 5, English |
9799 | dstv.com | 11421 | 5.05 | 200 | HTML 5, English |
9800 | clarity.design | 11423 | 5.05 | 200 | HTML 5, English |
Data from: Open PageRank