Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
20301 | truenas.com | 23709 | 4.84 | 200 | HTML 5, English |
20302 | skyvia.com | 23710 | 4.84 | 200 | HTML 5, English |
20303 | www1.bloomingdales.com | 23711 | 4.84 | 200 | HTML 5, English |
20304 | kidshelpline.com.au | 23712 | 4.84 | 200 | HTML 5, English |
20305 | muslimcentral.com | 23713 | 4.84 | 200 | HTML 5, English |
20306 | kew.org | 23714 | 4.84 | 200 | HTML 5, English |
20307 | weboftrust.info | 23715 | 4.84 | 200 | HTML 5, English |
20308 | encyclopediaofmath.org | 23716 | 4.84 | 200 | HTML 5, English |
20309 | asumetech.com | 23717 | 4.84 | 200 | HTML 5, English |
20310 | useit.com | 23718 | 4.84 | 200 | HTML 5, English |
20311 | iarpa.gov | 23719 | 4.84 | 200 | HTML 5, English |
20312 | ypf.com | 23720 | 4.84 | 200 | HTML 5 |
20313 | wayback.archive.org | 23721 | 4.84 | 200 | HTML 5, English |
20314 | americanheritage.com | 23722 | 4.84 | 200 | HTML 5, English |
20315 | daily.bandcamp.com | 23723 | 4.84 | 200 | HTML 5, No Lang |
20316 | docs.metabox.io | 23724 | 4.84 | 200 | HTML 5, English |
20317 | corbanworks.com | 23726 | 4.84 | 200 | HTML 5, English |
20318 | linguistics.ucla.edu | 23727 | 4.84 | 200 | HTML 5, English |
20319 | royanews.tv | 23728 | 4.84 | 200 | HTML 5, English |
20320 | ow2.org | 23729 | 4.84 | 200 | HTML 5, English |
20321 | tandemdiabetes.com | 23730 | 4.84 | 200 | HTML 5, English |
20322 | vietworldkitchen.com | 23731 | 4.84 | 200 | HTML 5, English |
20323 | law.moj.gov.tw | 23732 | 4.84 | 200 | HTML 5 |
20324 | redq.io | 23733 | 4.84 | 200 | HTML 5, English |
20325 | mixup.com.mx | 23734 | 4.84 | 200 | No Lang |
20326 | vozpopuli.com | 23735 | 4.84 | 200 | Transitional |
20327 | conrad.com | 23736 | 4.84 | 200 | HTML 5, English |
20328 | ryanve.com | 23737 | 4.84 | 200 | HTML 5, English |
20329 | businessresearchinsights.com | 23738 | 4.84 | 200 | HTML 5, English |
20330 | milanote.com | 23739 | 4.84 | 200 | HTML 5, English |
20331 | poetryarchive.org | 23740 | 4.84 | 200 | HTML 5, English |
20332 | cacr.uwaterloo.ca | 23741 | 4.84 | 200 | No Lang |
20333 | atelier.net | 23742 | 4.84 | 200 | HTML 5, English |
20334 | clyp.it | 23743 | 4.84 | 200 | HTML 5, No Lang |
20335 | lupa.cz | 23744 | 4.84 | 200 | HTML 5 |
20336 | csicop.org | 23745 | 4.84 | 200 | HTML 5, No Lang |
20337 | mediafax.ro | 23746 | 4.84 | 200 | HTML 5 |
20338 | tatvic.com | 23747 | 4.84 | 200 | HTML 5, English |
20339 | partnermarketinghub.withgoogle.com | 23748 | 4.84 | 200 | HTML 5, English |
20340 | cedaro.com | 23749 | 4.84 | 200 | HTML 5, English |
20341 | jjj.blog | 23750 | 4.84 | 200 | HTML 5, English |
20342 | docs.itthinx.com | 23751 | 4.84 | 200 | HTML 5, English |
20343 | dlxplugins.com | 23752 | 4.84 | 200 | HTML 5, English |
20344 | ilghera.com | 23753 | 4.84 | 200 | HTML 5, English |
20345 | demo.tiptoppress.com | 23754 | 4.84 | 200 | HTML 5, English |
20346 | docs.joedolson.com | 23756 | 4.84 | 200 | HTML 5, English |
20347 | thechurchnews.com | 23757 | 4.84 | 200 | HTML 5, English |
20348 | qualitysafety.bmj.com | 23758 | 4.84 | 200 | HTML 5, English |
20349 | crd.lbl.gov | 23759 | 4.84 | 200 | HTML 5, English |
20350 | tuni.fi | 23760 | 4.84 | 200 | HTML 5 |
20351 | cgd.ucar.edu | 23761 | 4.84 | 200 | HTML 5, English |
20352 | dcmp.org | 23762 | 4.84 | 200 | HTML 5, English |
20353 | health.pa.gov | 23763 | 4.84 | 200 | HTML 5, English |
20354 | bigissue.com | 23764 | 4.84 | 200 | HTML 5, English |
20355 | darkhorse.com | 23765 | 4.84 | 200 | HTML 5, English |
20356 | randsinrepose.com | 23768 | 4.84 | 200 | HTML 5, English |
20357 | engineering.stanford.edu | 23770 | 4.84 | 200 | HTML 5, English |
20358 | clintonfoundation.org | 23771 | 4.84 | 200 | HTML 5, English |
20359 | opengovpartnership.org | 23772 | 4.84 | 200 | HTML 5, English |
20360 | ftp.cs.wisc.edu | 23773 | 4.84 | 200 | No Lang |
20361 | help.micro.blog | 23774 | 4.84 | 200 | HTML 5, English |
20362 | gettyimages.co.nz | 23775 | 4.84 | 200 | HTML 5, English |
20363 | greenbytes.de | 23777 | 4.84 | 200 | No Lang |
20364 | wesh.com | 23778 | 4.84 | 200 | HTML 5, English |
20365 | job-boards.greenhouse.io | 23780 | 4.84 | 200 | HTML 5, English |
20366 | weser-kurier.de | 23781 | 4.84 | 200 | HTML 5 |
20367 | defenseromania.ro | 23782 | 4.84 | 200 | HTML 5 |
20368 | blog.risingstack.com | 23783 | 4.84 | 200 | HTML 5, English |
20369 | shipt.com | 23785 | 4.84 | 200 | HTML 5, English |
20370 | grilld.com.au | 23786 | 4.84 | 200 | HTML 5, English |
20371 | cnnpressroom.blogs.cnn.com | 23788 | 4.84 | 200 | HTML 5, English |
20372 | thesslstore.com | 23789 | 4.84 | 200 | HTML 5, English |
20373 | idemitsu.com | 23790 | 4.84 | 200 | HTML 5 |
20374 | development.azurecurve.co.uk | 23791 | 4.84 | 200 | HTML 5, English |
20375 | riseup.net | 23792 | 4.84 | 200 | HTML 5, English |
20376 | kowsarhossain.com | 23793 | 4.84 | 200 | HTML 5, English |
20377 | weekendavisen.dk | 23795 | 4.84 | 200 | HTML 5 |
20378 | earmaster.com | 23796 | 4.84 | 200 | HTML 5, English |
20379 | hedera.com | 23797 | 4.84 | 200 | HTML 5, English |
20380 | thestaffcanteen.com | 23798 | 4.84 | 200 | HTML 5, No Lang |
20381 | swp.de | 23799 | 4.84 | 200 | HTML 5 |
20382 | lclark.edu | 23800 | 4.84 | 200 | HTML 5, English |
20383 | unitedbiblesocieties.org | 23801 | 4.84 | 200 | HTML 5, English |
20384 | jhpiego.org | 23802 | 4.84 | 200 | HTML 5, English |
20385 | fernandobriano.com | 23803 | 4.84 | 200 | HTML 5, No Lang |
20386 | outboundengine.com | 23804 | 4.84 | 200 | HTML 5, English |
20387 | predictit.org | 23805 | 4.84 | 200 | HTML 5, No Lang |
20388 | burton.com | 23806 | 4.84 | 200 | HTML 5, English |
20389 | english.hani.co.kr | 23807 | 4.84 | 200 | HTML 5 |
20390 | pocket-change.jp | 23808 | 4.84 | 200 | HTML 5, English |
20391 | eevblog.com | 23810 | 4.84 | 200 | HTML 5, English |
20392 | friends.pods.io | 23811 | 4.84 | 200 | HTML 5, English |
20393 | in4s.net | 23814 | 4.84 | 200 | HTML 5 |
20394 | kyivstar.ua | 23815 | 4.84 | 200 | HTML 5 |
20395 | kitchensanctuary.com | 23817 | 4.84 | 200 | HTML 5, English |
20396 | news.gov.hk | 23818 | 4.84 | 200 | Transitional |
20397 | presstelegram.com | 23819 | 4.84 | 200 | HTML 5, English |
20398 | zim-wiki.org | 23821 | 4.84 | 200 | No Lang, Transitional |
20399 | cabinetoffice.gov.uk | 23822 | 4.84 | 200 | HTML 5, English |
20400 | geckoboard.com | 23823 | 4.84 | 200 | HTML 5, English |
Data from: Open PageRank