Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
9201 | federalnewsnetwork.com | 10717 | 5.07 | 200 | HTML 5, English |
9202 | zakon.rada.gov.ua | 10718 | 5.07 | 200 | No Lang |
9203 | cdn.mos.cms.futurecdn.net | 10719 | 5.07 | 200 | No Lang |
9204 | knightlab.northwestern.edu | 10720 | 5.07 | 200 | HTML 5, English |
9205 | truckersagainsttrafficking.org | 10721 | 5.07 | 200 | HTML 5, English |
9206 | miamidade.gov | 10722 | 5.07 | 200 | No Lang, Transitional |
9207 | gwern.net | 10723 | 5.07 | 200 | HTML 5, English |
9208 | theshiftproject.org | 10724 | 5.07 | 200 | HTML 5 |
9209 | phonepe.com | 10725 | 5.07 | 200 | HTML 5, English |
9210 | mcmaster.ca | 10726 | 5.07 | 200 | HTML 5, English |
9211 | swarmapp.com | 10728 | 5.07 | 200 | HTML 5, English |
9212 | polygon.io | 10729 | 5.07 | 200 | HTML 5, English |
9213 | rismedia.com | 10730 | 5.07 | 200 | HTML 5, English |
9214 | lexilogos.com | 10731 | 5.07 | 200 | HTML 5 |
9215 | pushpress.com | 10732 | 5.07 | 200 | HTML 5, English |
9216 | orange.es | 10733 | 5.07 | 200 | HTML 5, No Lang |
9217 | msg91.com | 10735 | 5.07 | 200 | HTML 5, English |
9218 | onefootball.com | 10736 | 5.07 | 200 | HTML 5, English |
9219 | fletcherpenney.net | 10737 | 5.07 | 200 | HTML 5, English |
9220 | gcompris.net | 10738 | 5.07 | 200 | HTML 5, English |
9221 | wiki.centos.org | 10740 | 5.07 | 200 | No Lang, Strict |
9222 | sencha.com | 10741 | 5.07 | 200 | HTML 5, English |
9223 | westwing.de | 10743 | 5.07 | 200 | HTML 5 |
9224 | be.linkedin.com | 10744 | 5.07 | 200 | HTML 5, English |
9225 | h-online.com | 10745 | 5.07 | 200 | HTML 5, English |
9226 | keele.ac.uk | 10747 | 5.07 | 200 | HTML 5, English |
9227 | fangamer.com | 10748 | 5.07 | 200 | HTML 5, English |
9228 | diposit.ub.edu | 10749 | 5.07 | 200 | HTML 5, No Lang |
9229 | crick.ac.uk | 10751 | 5.07 | 200 | HTML 5, English |
9230 | shakespeare.mit.edu | 10752 | 5.07 | 200 | No Lang, Transitional |
9231 | montclair.edu | 10753 | 5.07 | 200 | English |
9232 | nrcan.gc.ca | 10754 | 5.07 | 200 | HTML 5, English |
9233 | universetoday.com | 10755 | 5.07 | 200 | HTML 5, English |
9234 | commerce.alaska.gov | 10756 | 5.07 | 200 | HTML 5, English |
9235 | bell.ca | 10757 | 5.07 | 200 | HTML 5, English |
9236 | fattureincloud.it | 10758 | 5.07 | 200 | HTML 5 |
9237 | us-cert.cisa.gov | 10760 | 5.07 | 200 | HTML 5, English |
9238 | history.state.gov | 10761 | 5.07 | 200 | HTML 5, English |
9239 | tabelog.com | 10762 | 5.07 | 200 | HTML 5 |
9240 | tylerpaper.com | 10763 | 5.07 | 200 | HTML 5, English |
9241 | lpi.usra.edu | 10764 | 5.07 | 200 | HTML 5, English |
9242 | vccircle.com | 10765 | 5.07 | 200 | HTML 5, English |
9243 | latingrammy.com | 10766 | 5.07 | 200 | HTML 5, English |
9244 | cosmosmagazine.com | 10767 | 5.07 | 200 | HTML 5, English |
9245 | migrationpolicy.org | 10769 | 5.07 | 200 | English |
9246 | mun.ca | 10771 | 5.07 | 200 | English |
9247 | easa.europa.eu | 10772 | 5.07 | 200 | HTML 5, English |
9248 | mvnrepository.com | 10774 | 5.07 | 200 | HTML 5, English |
9249 | biospace.com | 10775 | 5.07 | 200 | HTML 5, English |
9250 | gisaid.org | 10776 | 5.07 | 200 | HTML 5, English |
9251 | aes.org | 10777 | 5.07 | 200 | HTML 5, English |
9252 | allaboutcircuits.com | 10778 | 5.07 | 200 | HTML 5, English |
9253 | community.intel.com | 10779 | 5.07 | 200 | HTML 5, English |
9254 | aftvnews.com | 10780 | 5.07 | 200 | HTML 5, English |
9255 | fibl.org | 10781 | 5.07 | 200 | HTML 5, English |
9256 | revealjs.com | 10782 | 5.07 | 200 | HTML 5, No Lang |
9257 | uic.edu | 10783 | 5.07 | 200 | HTML 5, No Lang |
9258 | graphics.stanford.edu | 10784 | 5.07 | 200 | No Lang |
9259 | visitgreece.gr | 10786 | 5.07 | 200 | HTML 5, English |
9260 | asphaltgold.com | 10787 | 5.07 | 200 | HTML 5 |
9261 | catrobat.org | 10788 | 5.07 | 200 | HTML 5, English |
9262 | fox17online.com | 10789 | 5.07 | 200 | HTML 5, English |
9263 | environment.nsw.gov.au | 10790 | 5.07 | 200 | HTML 5, English |
9264 | as.com | 10792 | 5.07 | 200 | HTML 5 |
9265 | omniglot.com | 10793 | 5.07 | 200 | HTML 5, No Lang |
9266 | uzh.ch | 10794 | 5.07 | 200 | HTML 5 |
9267 | cebp.aacrjournals.org | 10795 | 5.07 | 200 | No Lang |
9268 | eidr.org | 10796 | 5.07 | 200 | HTML 5, English |
9269 | lis.virginia.gov | 10797 | 5.07 | 200 | HTML 5, English |
9270 | qoo10.jp | 10798 | 5.07 | 200 | Transitional |
9271 | reference.wolfram.com | 10799 | 5.07 | 200 | HTML 5, English |
9272 | corporate.target.com | 10800 | 5.07 | 200 | HTML 5, English |
9273 | cpr.org | 10801 | 5.07 | 200 | HTML 5, English |
9274 | mandrill.com | 10802 | 5.07 | 200 | HTML 5, English |
9275 | thelocal.se | 10803 | 5.07 | 200 | HTML 5, English |
9276 | smosh.com | 10804 | 5.07 | 200 | HTML 5, English |
9277 | starbucks.co.uk | 10805 | 5.07 | 200 | HTML 5, English |
9278 | canonical.com | 10806 | 5.07 | 200 | HTML 5, English |
9279 | uibk.ac.at | 10807 | 5.07 | 200 | HTML 5 |
9280 | solwininfotech.com | 10808 | 5.07 | 200 | HTML 5, English |
9281 | dnr.wi.gov | 10810 | 5.07 | 200 | HTML 5, English |
9282 | almanac.httparchive.org | 10811 | 5.07 | 200 | HTML 5, English |
9283 | fsi.stanford.edu | 10812 | 5.07 | 200 | HTML 5, English |
9284 | gsk.com | 10813 | 5.07 | 200 | HTML 5, English |
9285 | chicagoreader.com | 10814 | 5.07 | 200 | HTML 5, English |
9286 | htaccesstools.com | 10815 | 5.07 | 200 | HTML 5, English |
9287 | espncricinfo.com | 10816 | 5.07 | 200 | HTML 5, English |
9288 | vecernji.hr | 10818 | 5.07 | 200 | HTML 5 |
9289 | detail.tmall.com | 10819 | 5.07 | 200 | HTML 5, No Lang |
9290 | catalog.data.gov | 10821 | 5.07 | 200 | HTML 5, English |
9291 | mix.com | 10822 | 5.07 | 200 | HTML 5, No Lang |
9292 | ratings.fide.com | 10823 | 5.07 | 200 | HTML 5, English |
9293 | dnevnik.si | 10824 | 5.07 | 200 | HTML 5 |
9294 | pir.org | 10825 | 5.07 | 200 | HTML 5, English |
9295 | onf.fr | 10826 | 5.07 | 200 | HTML 5, English |
9296 | portal.aws.amazon.com | 10827 | 5.07 | 200 | HTML 5, English |
9297 | rsw.beck.de | 10828 | 5.07 | 200 | HTML 5 |
9298 | uaudio.com | 10829 | 5.07 | 200 | HTML 5, English |
9299 | spot.colorado.edu | 10830 | 5.07 | 200 | No Lang |
9300 | selenium.dev | 10831 | 5.07 | 200 | HTML 5, English |
Data from: Open PageRank