Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
10001 | gty.org | 11651 | 5.05 | 200 | HTML 5, No Lang |
10002 | commercialappeal.com | 11652 | 5.05 | 200 | HTML 5, English |
10003 | dynu.com | 11653 | 5.05 | 200 | HTML 5, English |
10004 | escapistmagazine.com | 11654 | 5.05 | 200 | HTML 5, English |
10005 | uselectionatlas.org | 11656 | 5.05 | 200 | No Lang, Strict |
10006 | toyota.jp | 11657 | 5.05 | 200 | HTML 5 |
10007 | c2pa.org | 11658 | 5.05 | 200 | HTML 5, English |
10008 | sega.co.jp | 11659 | 5.05 | 200 | HTML 5 |
10009 | killedbygoogle.com | 11660 | 5.05 | 200 | HTML 5, No Lang |
10010 | purl.stanford.edu | 11662 | 5.05 | 200 | HTML 5, English |
10011 | payrexx.com | 11663 | 5.05 | 200 | HTML 5, English |
10012 | lirias.kuleuven.be | 11664 | 5.05 | 200 | HTML 5, English |
10013 | ndl.go.jp | 11665 | 5.05 | 200 | Transitional |
10014 | mtl.org | 11666 | 5.05 | 200 | HTML 5, English |
10015 | tagblatt.ch | 11667 | 5.05 | 200 | HTML 5 |
10016 | animatedknots.com | 11668 | 5.05 | 200 | HTML 5, English |
10017 | urn.fi | 11669 | 5.05 | 200 | HTML 5, No Lang |
10018 | supportforums.cisco.com | 11670 | 5.05 | 200 | HTML 5, English |
10019 | shodan.io | 11671 | 5.05 | 200 | HTML 5, English |
10020 | uc.edu | 11672 | 5.05 | 200 | HTML 5, English |
10021 | scaledagileframework.com | 11674 | 5.05 | 200 | HTML 5, English |
10022 | davidrumsey.com | 11676 | 5.05 | 200 | HTML 5, English |
10023 | cryptoslate.com | 11677 | 5.05 | 200 | HTML 5, English |
10024 | nbcphiladelphia.com | 11678 | 5.05 | 200 | HTML 5, English |
10025 | momontimeout.com | 11679 | 5.05 | 200 | HTML 5, English |
10026 | redmine.org | 11680 | 5.05 | 200 | HTML 5, English |
10027 | globaledge.msu.edu | 11682 | 5.05 | 200 | HTML 5, English |
10028 | in2013dollars.com | 11683 | 5.05 | 200 | HTML 5, English |
10029 | magzter.com | 11684 | 5.05 | 200 | HTML 5, English |
10030 | learn24bd.com | 11685 | 5.05 | 200 | HTML 5, English |
10031 | defensa.com | 11686 | 5.05 | 200 | HTML 5, No Lang |
10032 | phdcomics.com | 11687 | 5.05 | 200 | No Lang |
10033 | community.algolia.com | 11688 | 5.05 | 200 | HTML 5, English |
10034 | support.1password.com | 11689 | 5.05 | 200 | HTML 5, English |
10035 | google.co.ke | 11690 | 5.05 | 200 | HTML 5, English |
10036 | sfconservancy.org | 11691 | 5.05 | 200 | HTML 5, English |
10037 | opus4.kobv.de | 11692 | 5.05 | 200 | HTML 5 |
10038 | fancs.com | 11693 | 5.05 | 200 | HTML 5 |
10039 | bigoven.com | 11694 | 5.05 | 200 | HTML 5, English |
10040 | linuxfr.org | 11696 | 5.05 | 200 | HTML 5 |
10041 | thepinknews.com | 11697 | 5.05 | 200 | HTML 5, English |
10042 | mith.umd.edu | 11698 | 5.05 | 200 | HTML 5, English |
10043 | sos.state.mn.us | 11699 | 5.05 | 200 | HTML 5, English |
10044 | pge.com | 11700 | 5.05 | 200 | HTML 5, English |
10045 | collaboraoffice.com | 11701 | 5.05 | 200 | HTML 5, English |
10046 | processing.org | 11702 | 5.05 | 200 | HTML 5, English |
10047 | mason.gmu.edu | 11703 | 5.05 | 200 | No Lang |
10048 | boinc.berkeley.edu | 11704 | 5.05 | 200 | HTML 5, English |
10049 | newsbomb.gr | 11705 | 5.05 | 200 | HTML 5 |
10050 | cta.tech | 11708 | 5.05 | 200 | HTML 5, No Lang |
10051 | kitabisa.com | 11709 | 5.05 | 200 | HTML 5 |
10052 | docs.monei.com | 11710 | 5.05 | 200 | HTML 5, English |
10053 | penguin.com | 11711 | 5.05 | 200 | HTML 5, English |
10054 | lupus.org | 11713 | 5.05 | 200 | HTML 5, English |
10055 | phnompenhpost.com | 11714 | 5.05 | 200 | HTML 5, English |
10056 | princess.com | 11715 | 5.05 | 200 | HTML 5, English |
10057 | sesameworkshop.org | 11716 | 5.05 | 200 | HTML 5, English |
10058 | upguard.com | 11717 | 5.05 | 200 | HTML 5, English |
10059 | matalan.co.uk | 11718 | 5.05 | 200 | HTML 5, English |
10060 | churchtimes.co.uk | 11719 | 5.05 | 200 | HTML 5, No Lang |
10061 | bootcamp.uxdesign.cc | 11720 | 5.05 | 200 | HTML 5, English |
10062 | getodk.org | 11722 | 5.05 | 200 | HTML 5, English |
10063 | nasonline.org | 11723 | 5.05 | 200 | HTML 5, English |
10064 | go.dev | 11725 | 5.05 | 200 | HTML 5, English |
10065 | 3ds.com | 11726 | 5.05 | 200 | HTML 5, English |
10066 | mamba.ru | 11727 | 5.05 | 200 | HTML 5, English |
10067 | kit.com | 11728 | 5.05 | 200 | HTML 5, English |
10068 | sr.ht | 11729 | 5.05 | 200 | HTML 5, English |
10069 | oatly.com | 11731 | 5.05 | 200 | HTML 5, English |
10070 | globalreporting.org | 11732 | 5.05 | 200 | HTML 5, English |
10071 | alleghenycounty.us | 11734 | 5.05 | 200 | HTML 5, English |
10072 | farm4.staticflickr.com | 11735 | 5.05 | 200 | No Lang |
10073 | googlepublicpolicy.blogspot.com | 11736 | 5.05 | 200 | HTML 5, English |
10074 | la-croix.com | 11737 | 5.05 | 200 | HTML 5 |
10075 | currencylayer.com | 11738 | 5.05 | 200 | HTML 5, No Lang |
10076 | 01.org | 11739 | 5.05 | 200 | HTML 5, English |
10077 | ip2location.com | 11740 | 5.05 | 200 | HTML 5, English |
10078 | kjzz.org | 11741 | 5.05 | 200 | HTML 5, English |
10079 | ecowatch.com | 11742 | 5.05 | 200 | HTML 5, English |
10080 | swri.org | 11743 | 5.05 | 200 | HTML 5, English |
10081 | spur.us | 11744 | 5.05 | 200 | HTML 5, English |
10082 | restrictcontentpro.com | 11745 | 5.05 | 200 | HTML 5, English |
10083 | google.com.ph | 11746 | 5.05 | 200 | HTML 5, English |
10084 | wedmegood.com | 11748 | 5.05 | 200 | HTML 5, English |
10085 | findmespot.com | 11749 | 5.05 | 200 | HTML 5, English |
10086 | u.osu.edu | 11750 | 5.05 | 200 | HTML 5, English |
10087 | thorax.bmj.com | 11751 | 5.05 | 200 | HTML 5, English |
10088 | techsoup.org | 11752 | 5.05 | 200 | No Lang |
10089 | the-saleroom.com | 11753 | 5.05 | 200 | HTML 5, English |
10090 | uplus.co.kr | 11754 | 5.05 | 200 | HTML 5 |
10091 | mq.edu.au | 11755 | 5.05 | 200 | HTML 5, English |
10092 | elpais.com.co | 11756 | 5.05 | 200 | HTML 5 |
10093 | supporters.eff.org | 11757 | 5.05 | 200 | English |
10094 | practicaltypography.com | 11758 | 5.05 | 200 | HTML 5, English |
10095 | vegansociety.com | 11760 | 5.05 | 200 | HTML 5, English |
10096 | movie.douban.com | 11761 | 5.05 | 200 | HTML 5 |
10097 | drought.gov | 11762 | 5.05 | 200 | HTML 5, English |
10098 | ftw.usatoday.com | 11764 | 5.05 | 200 | HTML 5, English |
10099 | packagecontrol.io | 11765 | 5.04 | 200 | HTML 5, No Lang |
10100 | mybakingaddiction.com | 11766 | 5.04 | 200 | HTML 5, English |
Data from: Open PageRank