Home·Top Domains
Question

Table filled in from the top 10m most popular web-sites.

  • List Quality
    • This list is prioritized on page rank vs traffic
      • May include both inactive or redirected sites
      • Does not reflect actual traffic or views
    • Other lists that might be interesting:

https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/

  • Not that concerned if this list is exact
    • Should provide good sampling of top sites
    • Order really isn't important in this case
  • This domain is decades old and not in the list!
    • Don't get much traffic, but 10M+??
  • Good performance test of a database table with 10M rows
    • Cold load times take several seconds when paged out
    • Well indexed but not performing as well as exepected
      • Much larger tables have perfomed much better
      • Appears like it is doing a full table-scan
    • This database is running in a SQL Server under docker.
    • Will load the same dataset into postgres for comparison.
  • Started crawling the home page for the first 1M domains.
    • Interested in stats on use of html5, proper html, etc..
    • Started with the first 100K sites and expanding to first 1M.
  • Observations
    • Surprising number of domains without a proper html lang attr
    • Surprising number of domains not using a proper HTML 5 doctype
    • The domains are naked without fully qualified with hostname.
    • Many domains don't have a dns entry for the naked domain.
      • domain.com vs www.domain.com which should redirect.
    • Suprising number of SSL errors on the naked domain
      • Clients can't connect without dropping ssl verification
      • With standard security checks clients will never get a redirect
      • domains that redirect behind an invalid ssl cert
        • this is super easy to fix/park to handle redirects.
        • lost revenue with that much linking to get on this list

Selected Top Domains

Searching..
skip report
#DomainSortRankStatusFlags
4501lexico.com53275.33200HTML 5, English
4502naver.com53285.33200HTML 5
4503zend.com53295.33200HTML 5, English
4504tpwd.texas.gov53305.33200HTML 5, English
4505wpastra.com53325.33200HTML 5, English
4506r-project.org53345.33200HTML 5, English
4507thenationalnews.com53355.33200HTML 5, English
4508embed.ly53365.33200HTML 5, English
4509fdc.nal.usda.gov53375.33200HTML 5, English
4510budgetbytes.com53385.33200HTML 5, English
4511profile.ameba.jp53395.33200HTML 5
4512whova.com53405.33200HTML 5, English
4513aclweb.org53415.33200English
4514element.io53425.33200No Lang
4515happycow.net53435.33200HTML 5, English
4516holytrinityorthodox.com53445.33200No Lang, Transitional
4517baynews9.com53455.33200HTML 5, English
4518weblineindia.com53465.33200HTML 5, English
4519cash.me53475.33200HTML 5, English
4520flowcode.com53485.33200HTML 5, English
4521post.japanpost.jp53495.33200HTML 5
4522pandas.pydata.org53505.33200HTML 5, No Lang
4523pix11.com53515.33200HTML 5, English
4524cs.cornell.edu53525.33200HTML 5, English
4525active.com53545.33200HTML 5, English
4526garanteprivacy.it53555.33200HTML 5
4527careers.google.com53565.33200HTML 5, English
4528gpg4win.org53575.33200No Lang, Strict
4529time.is53585.33200HTML 5, English
4530rti.org53595.33200HTML 5, English
4531sonarsource.com53605.33200HTML 5, English
4532threema.ch53615.33200HTML 5, English
4533convinceandconvert.com53625.33200HTML 5, English
4534datacenterknowledge.com53635.33200HTML 5, English
4535nespresso.com53655.33200HTML 5, English
4536en.wikiquote.org53665.33200HTML 5, No Lang
4537radar.oreilly.com53675.33200HTML 5, English
4538merchants.google.com53695.33200HTML 5, English
4539psmag.com53715.33200HTML 5, English
4540badoo.com53725.33200HTML 5, English
4541evo.com53745.33200HTML 5, English
4542shows.acast.com53755.33200HTML 5, English
4543rochester.edu53765.33200HTML 5, English
4544datawrapper.dwcdn.net53775.33200No Lang
4545pacsun.com53785.33200HTML 5, English
4546google.github.io53795.33200HTML 5, No Lang
4547lifehacker.com.au53805.33200HTML 5, English
4548kennedyspacecenter.com53815.33200HTML 5, English
4549draft.blogger.com53845.33200HTML 5, English
4550momoyoga.com53855.33200HTML 5, English
4551google.com.vn53865.33200HTML 5, English
4552aeon.co53885.33200HTML 5, English
4553globaltimes.cn53895.33200HTML 5, English
4554poedit.net53925.33200HTML 5, English
4555cloudup.com53935.33200HTML 5, No Lang
4556breuninger.com53945.33200HTML 5, English
4557losangeles.cbslocal.com53955.33200HTML 5, English
4558gov.mb.ca53975.33200HTML 5, English
4559clockify.me53985.33200HTML 5, No Lang
4560pinterest.ch54005.33200HTML 5, English
4561appadvice.com54015.33200HTML 5, No Lang
4562tradingeconomics.com54025.33200HTML 5, No Lang
4563sebrae.com.br54035.33200HTML 5
4564idokep.hu54045.33200HTML 5
4565ru.linkedin.com54055.33200HTML 5
4566icons.getbootstrap.com54065.33200HTML 5, English
4567haaretz.co.il54085.33200HTML 5
4568google-latlong.blogspot.com54095.33200HTML 5, English
4569grafana.com54105.33200HTML 5, English
457021.edu.ar54115.33200HTML 5
4571yankodesign.com54125.33200HTML 5, English
4572it.pinterest.com54135.33200HTML 5, English
4573uptodate.com54145.33200HTML 5, No Lang
4574cloud.tencent.com54155.33200HTML 5
4575onezero.medium.com54165.33200HTML 5, English
4576encyclopedia.com54175.33200HTML 5, English
4577bitcatcha.com54185.33200HTML 5, English
4578chicago.cbslocal.com54195.33200HTML 5, English
4579raphkoster.com54215.33200HTML 5, English
4580huffingtonpost.ca54225.33200HTML 5, English
4581ally.com54235.33200HTML 5, English
4582sympla.com.br54245.33200HTML 5
4583sway.office.com54255.33200English
4584brightside.me54265.33200HTML 5, English
4585fire.ca.gov54275.33200HTML 5, English
4586scotthelme.co.uk54285.33200HTML 5, English
4587themarginalian.org54295.33200HTML 5, English
4588usability.gov54305.33200No Lang
4589generatepress.com54315.33200HTML 5, English
4590openclassrooms.com54325.33200HTML 5, English
4591blogs.msdn.microsoft.com54335.33200HTML 5, English
4592chicagobooth.edu54345.33200HTML 5, English
4593apoia.se54355.33200HTML 5, No Lang
4594ushmm.org54365.33200HTML 5, English
4595synology.com54375.33200HTML 5, English
4596ics.uci.edu54385.33200HTML 5, English
4597mamamia.com.au54395.33200HTML 5, English
4598lexpress.fr54405.33200HTML 5
4599chemistryworld.com54415.32200HTML 5, English
4600newatlas.com54425.32200HTML 5, English
Data from: Open PageRank