Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
12801 | skatteverket.se | 14933 | 4.98 | 200 | HTML 5 |
12802 | nhm.org | 14934 | 4.98 | 200 | HTML 5, English |
12803 | picjumbo.com | 14935 | 4.98 | 200 | HTML 5, English |
12804 | mobileiron.com | 14936 | 4.98 | 200 | HTML 5, English |
12805 | aseelapp.com | 14937 | 4.98 | 200 | HTML 5, English |
12806 | kslegislature.org | 14938 | 4.98 | 200 | English, Transitional |
12807 | mistplay.com | 14939 | 4.98 | 200 | HTML 5, No Lang |
12808 | web.ics.purdue.edu | 14940 | 4.98 | 200 | No Lang |
12809 | itbrief.com.au | 14941 | 4.97 | 200 | HTML 5, English |
12810 | fd.nl | 14943 | 4.97 | 200 | HTML 5 |
12811 | nexttv.com | 14944 | 4.97 | 200 | HTML 5, English |
12812 | diariosur.es | 14945 | 4.97 | 200 | HTML 5 |
12813 | ahdictionary.com | 14946 | 4.97 | 200 | No Lang |
12814 | sling.com | 14947 | 4.97 | 200 | HTML 5, English |
12815 | trakt.tv | 14948 | 4.97 | 200 | HTML 5, No Lang |
12816 | knime.com | 14949 | 4.97 | 200 | HTML 5, English |
12817 | einpresswire.com | 14950 | 4.97 | 200 | HTML 5, English |
12818 | idangero.us | 14951 | 4.97 | 200 | HTML 5, No Lang |
12819 | sigmalive.com | 14952 | 4.97 | 200 | HTML 5, No Lang |
12820 | 19january2017snapshot.epa.gov | 14954 | 4.97 | 200 | English |
12821 | digitalpreservation.gov | 14955 | 4.97 | 200 | HTML 5, No Lang |
12822 | vcu.edu | 14956 | 4.97 | 200 | HTML 5, English |
12823 | secure.cardcom.solutions | 14957 | 4.97 | 200 | HTML 5, No Lang |
12824 | therepairmanual.com | 14958 | 4.97 | 200 | HTML 5, English |
12825 | it-recht-kanzlei.de | 14959 | 4.97 | 200 | HTML 5 |
12826 | app.pipefy.com | 14960 | 4.97 | 200 | HTML 5, English |
12827 | share.upmc.com | 14961 | 4.97 | 200 | HTML 5, English |
12828 | glympse.com | 14963 | 4.97 | 200 | HTML 5, English |
12829 | francetelevisions.fr | 14964 | 4.97 | 200 | HTML 5 |
12830 | dharmann.com | 14965 | 4.97 | 200 | HTML 5, English |
12831 | mts.by | 14967 | 4.97 | 200 | HTML 5 |
12832 | namecoin.org | 14968 | 4.97 | 200 | HTML 5, No Lang |
12833 | news.ifeng.com | 14969 | 4.97 | 200 | HTML 5 |
12834 | browsersync.io | 14971 | 4.97 | 200 | HTML 5, English |
12835 | www2.nhk.or.jp | 14972 | 4.97 | 200 | HTML 5 |
12836 | zacks.com | 14973 | 4.97 | 200 | HTML 5, English |
12837 | sagaftra.org | 14974 | 4.97 | 200 | HTML 5, English |
12838 | corporate.zalando.com | 14975 | 4.97 | 200 | HTML 5, English |
12839 | wienerlinien.at | 14976 | 4.97 | 200 | HTML 5 |
12840 | mapsplatform.google.com | 14977 | 4.97 | 200 | HTML 5, English |
12841 | paymentsdive.com | 14978 | 4.97 | 200 | HTML 5, English |
12842 | common-lisp.net | 14980 | 4.97 | 200 | HTML 5, English |
12843 | luzern.com | 14981 | 4.97 | 200 | HTML 5, English |
12844 | tinymce.com | 14982 | 4.97 | 200 | HTML 5, English |
12845 | ubu.com | 14983 | 4.97 | 200 | No Lang, Transitional |
12846 | tspace.library.utoronto.ca | 14984 | 4.97 | 200 | HTML 5, English |
12847 | stockholm.se | 14985 | 4.97 | 200 | HTML 5 |
12848 | rambus.com | 14986 | 4.97 | 200 | HTML 5, English |
12849 | breastcancer.org | 14987 | 4.97 | 200 | HTML 5, English |
12850 | host.madison.com | 14988 | 4.97 | 200 | HTML 5, English |
12851 | cakeresume.com | 14989 | 4.97 | 200 | HTML 5, English |
12852 | translate.google.de | 14990 | 4.97 | 200 | HTML 5 |
12853 | petsymposium.org | 14991 | 4.97 | 200 | No Lang, Strict |
12854 | jinja.palletsprojects.com | 14992 | 4.97 | 200 | HTML 5, English |
12855 | hillsong.com | 14993 | 4.97 | 200 | HTML 5, English |
12856 | garymarcus.substack.com | 14994 | 4.97 | 200 | HTML 5, English |
12857 | hanser-literaturverlage.de | 14996 | 4.97 | 200 | HTML 5 |
12858 | athleta.gap.com | 14997 | 4.97 | 200 | HTML 5, English |
12859 | rhino3d.com | 14998 | 4.97 | 200 | HTML 5, English |
12860 | accuradio.com | 14999 | 4.97 | 200 | HTML 5, No Lang |
12861 | marco.org | 15000 | 4.97 | 200 | HTML 5, English |
12862 | 2kgames.com | 15001 | 4.97 | 200 | HTML 5, No Lang |
12863 | adrecord.com | 15002 | 4.97 | 200 | HTML 5, English |
12864 | superoffice.com | 15003 | 4.97 | 200 | HTML 5, English |
12865 | jegtheme.com | 15004 | 4.97 | 200 | No Lang |
12866 | fr.pinterest.com | 15005 | 4.97 | 200 | HTML 5, English |
12867 | arup.com | 15006 | 4.97 | 200 | HTML 5, English |
12868 | foobar2000.org | 15007 | 4.97 | 200 | English, Strict |
12869 | arstechnica.co.uk | 15008 | 4.97 | 200 | HTML 5, English |
12870 | dre.pt | 15009 | 4.97 | 200 | HTML 5, No Lang |
12871 | app.livestorm.co | 15010 | 4.97 | 200 | HTML 5, No Lang |
12872 | phonearena.com | 15012 | 4.97 | 200 | HTML 5, English |
12873 | austinkleon.com | 15013 | 4.97 | 200 | HTML 5, English |
12874 | collectionscanada.gc.ca | 15014 | 4.97 | 200 | HTML 5 |
12875 | cpl.thalesgroup.com | 15015 | 4.97 | 200 | HTML 5, English |
12876 | michiganradio.org | 15016 | 4.97 | 200 | HTML 5, English |
12877 | weboutsourcing-gateway.com | 15017 | 4.97 | 200 | HTML 5, No Lang |
12878 | bugzilla.org | 15018 | 4.97 | 200 | HTML 5, English |
12879 | historyextra.com | 15019 | 4.97 | 200 | HTML 5, English |
12880 | errenskitchen.com | 15020 | 4.97 | 200 | HTML 5, English |
12881 | flywire.com | 15021 | 4.97 | 200 | HTML 5, English |
12882 | qmee.com | 15022 | 4.97 | 200 | HTML 5, English |
12883 | sleeknote.com | 15023 | 4.97 | 200 | HTML 5, English |
12884 | gasbuddy.com | 15024 | 4.97 | 200 | HTML 5, English |
12885 | uxmovement.com | 15025 | 4.97 | 200 | HTML 5, English |
12886 | voeazul.com.br | 15026 | 4.97 | 200 | HTML 5, English |
12887 | ukcop26.org | 15027 | 4.97 | 200 | HTML 5, No Lang |
12888 | news.bloombergtax.com | 15028 | 4.97 | 200 | HTML 5, English |
12889 | cfs.gov.hk | 15029 | 4.97 | 200 | No Lang, Transitional |
12890 | setosa.io | 15031 | 4.97 | 200 | HTML 5, English |
12891 | mediaratingcouncil.org | 15032 | 4.97 | 200 | HTML 5, English |
12892 | chanzuckerberg.com | 15033 | 4.97 | 200 | HTML 5, English |
12893 | govexec.com | 15034 | 4.97 | 200 | HTML 5, English |
12894 | wfdeaf.org | 15035 | 4.97 | 200 | HTML 5, English |
12895 | legis.state.pa.us | 15036 | 4.97 | 200 | HTML 5, English |
12896 | aicpa-cima.com | 15037 | 4.97 | 200 | HTML 5, English |
12897 | primaonline.it | 15038 | 4.97 | 200 | HTML 5 |
12898 | icu-project.org | 15039 | 4.97 | 200 | No Lang |
12899 | hookedonhouses.net | 15040 | 4.97 | 200 | HTML 5, English |
12900 | users.encs.concordia.ca | 15041 | 4.97 | 200 | No Lang |
Data from: Open PageRank