Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
20101 | phorest.com | 23459 | 4.84 | 200 | HTML 5, English |
20102 | spacesworks.com | 23462 | 4.84 | 200 | HTML 5, English |
20103 | beyondyoga.com | 23463 | 4.84 | 200 | HTML 5, English |
20104 | bbvaopenmind.com | 23464 | 4.84 | 200 | HTML 5, English |
20105 | film1.nl | 23467 | 4.84 | 200 | HTML 5 |
20106 | artuk.org | 23468 | 4.84 | 200 | HTML 5, English |
20107 | navytimes.com | 23469 | 4.84 | 200 | HTML 5, English |
20108 | hrank.com | 23471 | 4.84 | 200 | HTML 5, English |
20109 | esahubble.org | 23472 | 4.84 | 200 | HTML 5, English |
20110 | tudocelular.com | 23473 | 4.84 | 200 | HTML 5 |
20111 | personalgenomes.org | 23474 | 4.84 | 200 | HTML 5, English |
20112 | coincodex.com | 23475 | 4.84 | 200 | HTML 5, English |
20113 | gorillaz.com | 23477 | 4.84 | 200 | HTML 5, English |
20114 | american-giant.com | 23478 | 4.84 | 200 | HTML 5, English |
20115 | ppcprotect.com | 23479 | 4.84 | 200 | HTML 5, English |
20116 | newscaststudio.com | 23480 | 4.84 | 200 | HTML 5, English |
20117 | horrordna.com | 23481 | 4.84 | 200 | HTML 5, English |
20118 | texmacs.org | 23482 | 4.84 | 200 | No Lang |
20119 | twobithistory.org | 23483 | 4.84 | 200 | HTML 5, No Lang |
20120 | westernsydney.edu.au | 23484 | 4.84 | 200 | HTML 5, English |
20121 | uclabruins.com | 23485 | 4.84 | 200 | HTML 5, English |
20122 | australian.museum | 23486 | 4.84 | 200 | HTML 5, English |
20123 | mulesoft.com | 23488 | 4.84 | 200 | HTML 5, English |
20124 | info.yahoo.com | 23490 | 4.84 | 200 | HTML 5, English |
20125 | nintendoeverything.com | 23491 | 4.84 | 200 | HTML 5, English |
20126 | flightcentre.com.au | 23492 | 4.84 | 200 | HTML 5, English |
20127 | flaglermuseum.us | 23493 | 4.84 | 200 | HTML 5, English |
20128 | chat.wordpress.org | 23494 | 4.84 | 200 | HTML 5, English |
20129 | colbertnation.com | 23495 | 4.84 | 200 | HTML 5, English |
20130 | cloudpartners.transform.microsoft.com | 23496 | 4.84 | 200 | HTML 5, English |
20131 | whas11.com | 23497 | 4.84 | 200 | HTML 5, English |
20132 | mcb.com.pk | 23498 | 4.84 | 200 | HTML 5, English |
20133 | presentandcorrect.com | 23499 | 4.84 | 200 | HTML 5, English |
20134 | traviangames.com | 23500 | 4.84 | 200 | HTML 5, English |
20135 | ucsc.edu | 23501 | 4.84 | 200 | HTML 5, English |
20136 | kuladig.de | 23502 | 4.84 | 200 | HTML 5 |
20137 | bsc.news | 23503 | 4.84 | 200 | HTML 5, No Lang |
20138 | googleblog.blogspot.de | 23504 | 4.84 | 200 | HTML 5, English |
20139 | remix.run | 23505 | 4.84 | 200 | HTML 5, English |
20140 | kunstforum.de | 23506 | 4.84 | 200 | HTML 5 |
20141 | beatsaber.com | 23507 | 4.84 | 200 | HTML 5, English |
20142 | d-id.com | 23508 | 4.84 | 200 | HTML 5, English |
20143 | airbnb.ca | 23509 | 4.84 | 200 | HTML 5, English |
20144 | polevaultweb.com | 23510 | 4.84 | 200 | HTML 5, English |
20145 | scienceblogs.de | 23511 | 4.84 | 200 | HTML 5 |
20146 | ontheworldmap.com | 23512 | 4.84 | 200 | No Lang |
20147 | atlantamagazine.com | 23513 | 4.84 | 200 | English |
20148 | siliconvalley.com | 23514 | 4.84 | 200 | HTML 5, English |
20149 | glossa-journal.org | 23515 | 4.84 | 200 | HTML 5, English |
20150 | usa.yamaha.com | 23516 | 4.84 | 200 | HTML 5, English |
20151 | fridae.asia | 23517 | 4.84 | 200 | HTML 5, English |
20152 | de-eerstelijns.nl | 23521 | 4.84 | 200 | HTML 5 |
20153 | iotforall.com | 23523 | 4.84 | 200 | HTML 5, English |
20154 | cso.org | 23525 | 4.84 | 200 | HTML 5, English |
20155 | hak.hr | 23527 | 4.84 | 200 | HTML 5 |
20156 | siue.edu | 23528 | 4.84 | 200 | HTML 5, English |
20157 | philadelphiafed.org | 23531 | 4.84 | 200 | HTML 5, English |
20158 | georgiaencyclopedia.org | 23533 | 4.84 | 200 | HTML 5, English |
20159 | apps.db.ripe.net | 23534 | 4.84 | 200 | HTML 5, No Lang |
20160 | otsimo.com | 23536 | 4.84 | 200 | HTML 5, English |
20161 | klipfolio.com | 23537 | 4.84 | 200 | HTML 5, English |
20162 | jeskola.net | 23538 | 4.84 | 200 | HTML 5, No Lang |
20163 | bitinn.net | 23539 | 4.84 | 200 | HTML 5, English |
20164 | codecguide.com | 23540 | 4.84 | 200 | No Lang, Transitional |
20165 | statsmodels.org | 23541 | 4.84 | 200 | No Lang, Transitional |
20166 | monmouth.edu | 23542 | 4.84 | 200 | HTML 5, English |
20167 | mcclatchy.com | 23543 | 4.84 | 200 | HTML 5, No Lang |
20168 | cdn.meme.am | 23544 | 4.84 | 200 | HTML 5, English |
20169 | districtcouncils.gov.hk | 23545 | 4.84 | 200 | Strict |
20170 | boxcryptor.com | 23546 | 4.84 | 200 | HTML 5, English |
20171 | nestle.com.au | 23547 | 4.84 | 200 | HTML 5, English |
20172 | demos.ayecode.io | 23548 | 4.84 | 200 | HTML 5, English |
20173 | infosecwriteups.com | 23550 | 4.84 | 200 | HTML 5, No Lang |
20174 | itead.cc | 23552 | 4.84 | 200 | HTML 5, English |
20175 | rivian.com | 23553 | 4.84 | 200 | HTML 5, English |
20176 | avahi.org | 23554 | 4.84 | 200 | No Lang |
20177 | appsero.com | 23555 | 4.84 | 200 | HTML 5, English |
20178 | aeroportidipuglia.it | 23556 | 4.84 | 200 | HTML 5 |
20179 | carm.es | 23557 | 4.84 | 200 | Transitional |
20180 | status.aws.amazon.com | 23558 | 4.84 | 200 | HTML 5, No Lang |
20181 | ajax.nl | 23559 | 4.84 | 200 | HTML 5 |
20182 | americanimmigrationcouncil.org | 23560 | 4.84 | 200 | HTML 5, English |
20183 | wp-media.me | 23561 | 4.84 | 200 | HTML 5, English |
20184 | recsys.acm.org | 23562 | 4.84 | 200 | No Lang, Strict |
20185 | seic.com | 23563 | 4.84 | 200 | HTML 5, English |
20186 | texasattorneygeneral.gov | 23564 | 4.84 | 200 | HTML 5, English |
20187 | usd.edu | 23566 | 4.84 | 200 | HTML 5, No Lang |
20188 | fold3.com | 23567 | 4.84 | 200 | HTML 5, English |
20189 | eamusic.dartmouth.edu | 23568 | 4.84 | 200 | HTML 5, English |
20190 | larochesuryon.fr | 23569 | 4.84 | 200 | HTML 5 |
20191 | widgets.weforum.org | 23570 | 4.84 | 200 | HTML 5, English |
20192 | crooked.com | 23571 | 4.84 | 200 | HTML 5, English |
20193 | sphinxsearch.com | 23572 | 4.84 | 200 | English, Strict |
20194 | marcomilesi.com | 23575 | 4.84 | 200 | HTML 5, English |
20195 | cometcache.com | 23578 | 4.84 | 200 | HTML 5, English |
20196 | webexhibits.org | 23579 | 4.84 | 200 | No Lang, Transitional |
20197 | bbcearth.com | 23580 | 4.84 | 200 | HTML 5, English |
20198 | onemilliontweetmap.com | 23581 | 4.84 | 200 | HTML 5, English |
20199 | regan.dev | 23582 | 4.84 | 200 | HTML 5, English |
20200 | socialgeek.co | 23583 | 4.84 | 200 | HTML 5, English |
Data from: Open PageRank