Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
15501 | timescolonist.com | 18063 | 4.92 | 200 | HTML 5, English |
15502 | meljoulwan.com | 18065 | 4.92 | 200 | HTML 5, No Lang |
15503 | 4gamer.net | 18066 | 4.92 | 200 | Transitional |
15504 | zemanta.com | 18067 | 4.92 | 200 | HTML 5, English |
15505 | encyclopedia.ushmm.org | 18068 | 4.92 | 200 | HTML 5, English |
15506 | upmetrics.co | 18069 | 4.92 | 200 | HTML 5, English |
15507 | monaca.io | 18070 | 4.92 | 200 | HTML 5, English |
15508 | cise.ufl.edu | 18071 | 4.92 | 200 | HTML 5, English |
15509 | lootcrate.com | 18074 | 4.92 | 200 | HTML 5, English |
15510 | us.blackberry.com | 18075 | 4.92 | 200 | HTML 5, English |
15511 | lemelson.mit.edu | 18076 | 4.92 | 200 | HTML 5, English |
15512 | privacycg.github.io | 18077 | 4.92 | 200 | HTML 5, English |
15513 | saudia.com | 18078 | 4.92 | 200 | No Lang |
15514 | din.de | 18079 | 4.92 | 200 | HTML 5 |
15515 | brella.io | 18080 | 4.92 | 200 | HTML 5, English |
15516 | 13.cl | 18081 | 4.92 | 200 | HTML 5 |
15517 | soledadpenades.com | 18083 | 4.92 | 200 | HTML 5, No Lang |
15518 | scholar.googleusercontent.com | 18085 | 4.92 | 200 | HTML 5, No Lang |
15519 | medstarhealth.org | 18086 | 4.92 | 200 | HTML 5, No Lang |
15520 | cafebazaar.ir | 18087 | 4.92 | 200 | HTML 5 |
15521 | scielo.org.za | 18088 | 4.92 | 200 | No Lang |
15522 | veterans.gc.ca | 18089 | 4.92 | 200 | HTML 5, English |
15523 | biv.com | 18090 | 4.92 | 200 | HTML 5, English |
15524 | experian.co.uk | 18091 | 4.92 | 200 | HTML 5, English |
15525 | ror.org | 18092 | 4.92 | 200 | HTML 5, English |
15526 | prospectmagazine.co.uk | 18094 | 4.92 | 200 | HTML 5, English |
15527 | globus.org | 18096 | 4.92 | 200 | HTML 5, No Lang |
15528 | okayplayer.com | 18097 | 4.92 | 200 | HTML 5, English |
15529 | gizmag.com | 18098 | 4.92 | 200 | HTML 5, English |
15530 | ccny.cuny.edu | 18099 | 4.92 | 200 | HTML 5, English |
15531 | movableink.com | 18100 | 4.92 | 200 | HTML 5, English |
15532 | nationaljournal.com | 18101 | 4.92 | 200 | HTML 5, English |
15533 | foodbusinessnews.net | 18102 | 4.92 | 200 | HTML 5, English |
15534 | gearthblog.com | 18103 | 4.92 | 200 | HTML 5, English |
15535 | api.semanticscholar.org | 18104 | 4.92 | 200 | HTML 5, English |
15536 | radiopublic.com | 18105 | 4.92 | 200 | HTML 5, English |
15537 | australianunity.com.au | 18107 | 4.92 | 200 | HTML 5, English |
15538 | cassinfo.com | 18108 | 4.92 | 200 | HTML 5, English |
15539 | vegrecipesofindia.com | 18109 | 4.92 | 200 | HTML 5, English |
15540 | objectcache.pro | 18110 | 4.92 | 200 | HTML 5, No Lang |
15541 | klove.com | 18111 | 4.92 | 200 | HTML 5, English |
15542 | albawaba.com | 18112 | 4.92 | 200 | HTML 5, English |
15543 | bankier.pl | 18113 | 4.92 | 200 | HTML 5 |
15544 | turbogears.org | 18114 | 4.92 | 200 | No Lang, Transitional |
15545 | unav.edu | 18115 | 4.92 | 200 | HTML 5 |
15546 | unocha.org | 18116 | 4.92 | 200 | HTML 5, English |
15547 | aacu.org | 18117 | 4.92 | 200 | HTML 5, English |
15548 | jamaica-gleaner.com | 18118 | 4.92 | 200 | English |
15549 | ilmeteo.it | 18119 | 4.92 | 200 | |
15550 | extra-life.org | 18121 | 4.92 | 200 | HTML 5, English |
15551 | jp.linkedin.com | 18123 | 4.92 | 200 | HTML 5 |
15552 | hmc.edu | 18124 | 4.92 | 200 | HTML 5, English |
15553 | apigen.org | 18126 | 4.92 | 200 | HTML 5 |
15554 | workstem.com | 18127 | 4.92 | 200 | HTML 5 |
15555 | simkl.com | 18128 | 4.92 | 200 | HTML 5, No Lang |
15556 | animate.co.jp | 18129 | 4.92 | 200 | HTML 5 |
15557 | mclaren.com | 18130 | 4.92 | 200 | HTML 5, English |
15558 | uk.style.yahoo.com | 18131 | 4.92 | 200 | HTML 5, No Lang |
15559 | bsky.social | 18132 | 4.92 | 200 | HTML 5, English |
15560 | chamberofcommerce.com | 18133 | 4.92 | 200 | HTML 5, English |
15561 | developercommunity.visualstudio.com | 18134 | 4.92 | 200 | No Lang |
15562 | thecity.nyc | 18135 | 4.92 | 200 | HTML 5, English |
15563 | berliner-ensemble.de | 18136 | 4.92 | 200 | |
15564 | dca.ca.gov | 18137 | 4.92 | 200 | HTML 5, English |
15565 | cs.technion.ac.il | 18138 | 4.92 | 200 | HTML 5, English |
15566 | wienerzeitung.at | 18139 | 4.92 | 200 | HTML 5 |
15567 | hema.nl | 18140 | 4.92 | 200 | HTML 5 |
15568 | collectorsweekly.com | 18141 | 4.92 | 200 | HTML 5, English |
15569 | moodycenteratx.com | 18142 | 4.92 | 200 | HTML 5, English |
15570 | thriftytraveler.com | 18143 | 4.92 | 200 | HTML 5, English |
15571 | stockcharts.com | 18144 | 4.92 | 200 | HTML 5, English |
15572 | comsoc.org | 18145 | 4.92 | 200 | HTML 5, English |
15573 | vcstar.com | 18147 | 4.92 | 200 | HTML 5, English |
15574 | designernews.co | 18148 | 4.92 | 200 | HTML 5, English |
15575 | workspot.com | 18150 | 4.92 | 200 | HTML 5, English |
15576 | wallet.google | 18151 | 4.92 | 200 | HTML 5, English |
15577 | fox6now.com | 18152 | 4.92 | 200 | HTML 5, English |
15578 | scarymommy.com | 18154 | 4.92 | 200 | HTML 5, English |
15579 | infrequently.org | 18155 | 4.92 | 200 | HTML 5, English |
15580 | bjgp.org | 18156 | 4.92 | 200 | HTML 5, English |
15581 | webaxe.org | 18157 | 4.92 | 200 | HTML 5, English |
15582 | linphone.org | 18158 | 4.92 | 200 | HTML 5 |
15583 | transformativeworks.org | 18159 | 4.92 | 200 | HTML 5, English |
15584 | cuh.nhs.uk | 18160 | 4.92 | 200 | HTML 5, English |
15585 | hoyolab.com | 18161 | 4.92 | 200 | HTML 5, No Lang |
15586 | socialsecurity.gov | 18162 | 4.92 | 200 | HTML 5, English |
15587 | blogoscoped.com | 18165 | 4.92 | 200 | English, Strict |
15588 | nyc.streetsblog.org | 18166 | 4.92 | 200 | HTML 5, English |
15589 | s3-ap-northeast-1.amazonaws.com | 18167 | 4.92 | 200 | HTML 5, English |
15590 | schloesserland-sachsen.de | 18169 | 4.92 | 200 | HTML 5 |
15591 | blog.angular.io | 18171 | 4.92 | 200 | HTML 5, English |
15592 | restaurant.org | 18172 | 4.92 | 200 | HTML 5, English |
15593 | nordiskamuseet.se | 18173 | 4.92 | 200 | HTML 5 |
15594 | blogs.salesforce.com | 18174 | 4.92 | 200 | HTML 5, English |
15595 | apc.org | 18175 | 4.92 | 200 | HTML 5, English |
15596 | dl.gi.de | 18176 | 4.92 | 200 | HTML 5 |
15597 | gendai.ismedia.jp | 18177 | 4.92 | 200 | HTML 5 |
15598 | nbp.pl | 18178 | 4.92 | 200 | HTML 5 |
15599 | smallarmssurvey.org | 18179 | 4.92 | 200 | HTML 5, English |
15600 | wdrb.com | 18180 | 4.92 | 200 | HTML 5, English |
Data from: Open PageRank