Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18801 | censusreporter.org | 21909 | 4.87 | 200 | HTML 5, English |
18802 | digital.hbs.edu | 21911 | 4.87 | 200 | HTML 5, English |
18803 | transjakarta.co.id | 21912 | 4.87 | 200 | HTML 5 |
18804 | nlihc.org | 21913 | 4.87 | 200 | HTML 5, English |
18805 | s3-ap-southeast-1.amazonaws.com | 21914 | 4.87 | 200 | HTML 5, English |
18806 | finsmes.com | 21915 | 4.87 | 200 | English |
18807 | stelladot.com | 21916 | 4.87 | 200 | HTML 5, English |
18808 | cdm.link | 21918 | 4.87 | 200 | HTML 5, English |
18809 | yosemite.epa.gov | 21919 | 4.87 | 200 | No Lang, Transitional |
18810 | shrsl.com | 21920 | 4.87 | 200 | No Lang |
18811 | knmi.nl | 21921 | 4.87 | 200 | HTML 5 |
18812 | inrs.fr | 21922 | 4.87 | 200 | HTML 5 |
18813 | lcdf.org | 21923 | 4.87 | 200 | HTML 5, No Lang |
18814 | eckankar.org | 21924 | 4.87 | 200 | HTML 5, English |
18815 | aprs.org | 21925 | 4.87 | 200 | No Lang |
18816 | powershellmagazine.com | 21927 | 4.87 | 200 | HTML 5, English |
18817 | beteve.cat | 21929 | 4.87 | 200 | HTML 5 |
18818 | faber.co.uk | 21931 | 4.87 | 200 | HTML 5, English |
18819 | pittsburghmagazine.com | 21933 | 4.87 | 200 | HTML 5, English |
18820 | mfat.govt.nz | 21934 | 4.87 | 200 | HTML 5, English |
18821 | larepublica.co | 21935 | 4.87 | 200 | HTML 5 |
18822 | compass.com | 21936 | 4.87 | 200 | HTML 5, English |
18823 | cornellsun.com | 21937 | 4.87 | 200 | HTML 5, English |
18824 | timee.co.jp | 21938 | 4.87 | 200 | HTML 5 |
18825 | app.uniswap.org | 21939 | 4.87 | 200 | HTML 5, No Lang |
18826 | therpf.com | 21940 | 4.87 | 200 | HTML 5, No Lang |
18827 | pro.sony | 21941 | 4.87 | 200 | HTML 5, English |
18828 | gardenia.net | 21943 | 4.87 | 200 | HTML 5, English |
18829 | bond.edu.au | 21944 | 4.87 | 200 | HTML 5, English |
18830 | bluestarfam.org | 21945 | 4.87 | 200 | HTML 5, English |
18831 | nea.gov.sg | 21946 | 4.87 | 200 | English |
18832 | chequeado.com | 21947 | 4.87 | 200 | HTML 5 |
18833 | we.riseup.net | 21949 | 4.87 | 200 | HTML 5, No Lang |
18834 | lefooding.com | 21950 | 4.87 | 200 | HTML 5 |
18835 | lullabot.com | 21952 | 4.87 | 200 | HTML 5, English |
18836 | law.indiana.edu | 21954 | 4.87 | 200 | HTML 5, English |
18837 | fox11online.com | 21955 | 4.87 | 200 | HTML 5, English |
18838 | whitecase.com | 21956 | 4.87 | 200 | HTML 5, English |
18839 | grammar.about.com | 21957 | 4.87 | 200 | HTML 5, English |
18840 | petronas.com | 21958 | 4.87 | 200 | HTML 5, English |
18841 | thegreatdiscontent.com | 21959 | 4.87 | 200 | HTML 5, English |
18842 | tandem.net | 21960 | 4.87 | 200 | HTML 5, English |
18843 | stacks.cdc.gov | 21961 | 4.87 | 200 | English |
18844 | mediatum.ub.tum.de | 21962 | 4.87 | 200 | No Lang, Transitional |
18845 | diamond.jp | 21963 | 4.87 | 200 | HTML 5 |
18846 | commercialcafe.com | 21964 | 4.87 | 200 | HTML 5, English |
18847 | oag.state.va.us | 21966 | 4.87 | 200 | HTML 5, English |
18848 | innocentive.com | 21968 | 4.87 | 200 | HTML 5, English |
18849 | lollapaloozade.com | 21969 | 4.87 | 200 | HTML 5 |
18850 | madonna.com | 21970 | 4.87 | 200 | HTML 5, English |
18851 | paulsmith.com | 21971 | 4.87 | 200 | HTML 5, English |
18852 | gcaptain.com | 21972 | 4.87 | 200 | HTML 5, English |
18853 | amightygirl.com | 21973 | 4.87 | 200 | HTML 5, English |
18854 | developer.bu.edu | 21974 | 4.87 | 200 | HTML 5, English |
18855 | kottayam.nic.in | 21975 | 4.87 | 200 | HTML 5 |
18856 | oem.bmj.com | 21976 | 4.87 | 200 | HTML 5, English |
18857 | katasztrofavedelem.hu | 21977 | 4.87 | 200 | HTML 5 |
18858 | incident57.com | 21979 | 4.87 | 200 | HTML 5, English |
18859 | gsgd.co.uk | 21981 | 4.87 | 200 | HTML 5, English |
18860 | biografiasyvidas.com | 21982 | 4.87 | 200 | HTML 5 |
18861 | clevelandhistorical.org | 21984 | 4.87 | 200 | HTML 5, English |
18862 | chs.harvard.edu | 21985 | 4.87 | 200 | HTML 5, English |
18863 | aaja.org | 21986 | 4.87 | 200 | HTML 5, English |
18864 | najdi.si | 21987 | 4.87 | 200 | HTML 5, No Lang |
18865 | hub.arcgis.com | 21989 | 4.87 | 200 | HTML 5, English |
18866 | history.house.gov | 21990 | 4.87 | 200 | HTML 5, English |
18867 | kvv.de | 21991 | 4.87 | 200 | HTML 5 |
18868 | ctbto.org | 21992 | 4.87 | 200 | HTML 5, No Lang |
18869 | news24online.com | 21993 | 4.87 | 200 | HTML 5, English |
18870 | lifenews.com | 21995 | 4.87 | 200 | HTML 5, English |
18871 | abcbirds.org | 21996 | 4.87 | 200 | HTML 5, English |
18872 | digi.com | 21998 | 4.87 | 200 | HTML 5, English |
18873 | wwpdb.org | 21999 | 4.87 | 200 | HTML 5, English |
18874 | vietnamtourism.gov.vn | 22000 | 4.87 | 200 | HTML 5 |
18875 | dataversity.net | 22001 | 4.87 | 200 | HTML 5, English |
18876 | uw.edu.pl | 22002 | 4.87 | 200 | HTML 5 |
18877 | scholar.google.co.jp | 22003 | 4.86 | 200 | HTML 5, No Lang |
18878 | parentmap.com | 22004 | 4.86 | 200 | HTML 5, English |
18879 | computersweden.idg.se | 22005 | 4.86 | 200 | HTML 5 |
18880 | bm.ge | 22006 | 4.86 | 200 | HTML 5 |
18881 | perso.telecom-paristech.fr | 22007 | 4.86 | 200 | HTML 5 |
18882 | heinz.cmu.edu | 22008 | 4.86 | 200 | HTML 5, English |
18883 | modelmayhem.com | 22009 | 4.86 | 200 | HTML 5, English |
18884 | lhotellerie-restauration.fr | 22010 | 4.86 | 200 | HTML 5 |
18885 | wordproject.org | 22011 | 4.86 | 200 | English |
18886 | bsse.ethz.ch | 22012 | 4.86 | 200 | HTML 5, English |
18887 | hcch.net | 22013 | 4.86 | 200 | HTML 5, English |
18888 | fwf.ac.at | 22014 | 4.86 | 200 | HTML 5 |
18889 | localbitcoins.com | 22015 | 4.86 | 200 | HTML 5, English |
18890 | sfaf.org | 22016 | 4.86 | 200 | HTML 5, English |
18891 | goodpods.com | 22017 | 4.86 | 200 | HTML 5, English |
18892 | api.yandex.ru | 22018 | 4.86 | 200 | HTML 5 |
18893 | eprints.nottingham.ac.uk | 22020 | 4.86 | 200 | HTML 5, English |
18894 | queerty.com | 22021 | 4.86 | 200 | HTML 5, English |
18895 | skoove.com | 22022 | 4.86 | 200 | HTML 5, English |
18896 | snapfiles.com | 22023 | 4.86 | 200 | English, Strict |
18897 | genshin.hoyoverse.com | 22024 | 4.86 | 200 | HTML 5, No Lang |
18898 | theartsdesk.com | 22025 | 4.86 | 200 | English |
18899 | mva.microsoft.com | 22026 | 4.86 | 200 | HTML 5, English |
18900 | scratchapixel.com | 22027 | 4.86 | 200 | HTML 5, English |
Data from: Open PageRank