Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
11001 | financemagnates.com | 12822 | 5.01 | 200 | HTML 5, English |
11002 | streeteasy.com | 12823 | 5.01 | 200 | HTML 5, English |
11003 | 2009-2017.state.gov | 12824 | 5.01 | 200 | HTML 5, No Lang |
11004 | validator.ampproject.org | 12825 | 5.01 | 200 | HTML 5, No Lang |
11005 | dayoneapp.com | 12826 | 5.01 | 200 | HTML 5, English |
11006 | r-bloggers.com | 12827 | 5.01 | 200 | HTML 5, English |
11007 | dailynous.com | 12828 | 5.01 | 200 | HTML 5, English |
11008 | smittenkitchen.com | 12830 | 5.01 | 200 | HTML 5, English |
11009 | kika.de | 12831 | 5.01 | 200 | HTML 5, No Lang |
11010 | iledefrance-mobilites.fr | 12832 | 5.01 | 200 | HTML 5 |
11011 | madinamerica.com | 12833 | 5.01 | 200 | English |
11012 | hope.edu | 12834 | 5.01 | 200 | HTML 5, English |
11013 | uantwerpen.be | 12835 | 5.01 | 200 | HTML 5 |
11014 | api.jqueryui.com | 12836 | 5.01 | 200 | HTML 5, English |
11015 | mapmyrun.com | 12837 | 5.01 | 200 | HTML 5, English |
11016 | labpano.com | 12838 | 5.01 | 200 | HTML 5, No Lang |
11017 | groovypost.com | 12839 | 5.01 | 200 | HTML 5, English |
11018 | remita.net | 12840 | 5.01 | 200 | HTML 5, English |
11019 | freedom.press | 12841 | 5.01 | 200 | HTML 5, English |
11020 | vitacost.com | 12842 | 5.01 | 200 | HTML 5, English |
11021 | unherd.com | 12843 | 5.01 | 200 | HTML 5, English |
11022 | gotomeeting.com | 12844 | 5.01 | 200 | HTML 5, English |
11023 | macmillandictionary.com | 12845 | 5.01 | 200 | HTML 5, No Lang |
11024 | acl.gov | 12847 | 5.01 | 200 | HTML 5, English |
11025 | capmetro.org | 12848 | 5.01 | 200 | HTML 5, English |
11026 | bpmn.org | 12849 | 5.01 | 200 | HTML 5, No Lang |
11027 | emscripten.org | 12850 | 5.01 | 200 | HTML 5, English |
11028 | perseo.ec | 12851 | 5.01 | 200 | HTML 5 |
11029 | passepartout.net | 12852 | 5.01 | 200 | HTML 5 |
11030 | filamentgroup.com | 12853 | 5.01 | 200 | HTML 5, English |
11031 | litecoin.org | 12854 | 5.01 | 200 | HTML 5, English |
11032 | jedit.org | 12855 | 5.01 | 200 | No Lang, Transitional |
11033 | stuffandnonsense.co.uk | 12856 | 5.01 | 200 | HTML 5, English |
11034 | cyprus-mail.com | 12857 | 5.01 | 200 | HTML 5, English |
11035 | taito.co.jp | 12858 | 5.01 | 200 | HTML 5 |
11036 | icis.corp.delaware.gov | 12859 | 5.01 | 200 | No Lang, Strict |
11037 | veteranscrisisline.net | 12860 | 5.01 | 200 | HTML 5, English |
11038 | allmodern.com | 12861 | 5.01 | 200 | HTML 5, English |
11039 | ready2order.com | 12862 | 5.01 | 200 | HTML 5, English |
11040 | finder.com | 12864 | 5.01 | 200 | HTML 5, English |
11041 | beeradvocate.com | 12865 | 5.01 | 200 | HTML 5, English |
11042 | traderjoes.com | 12866 | 5.01 | 200 | HTML 5, English |
11043 | centrepompidou.fr | 12867 | 5.01 | 200 | HTML 5, English |
11044 | shohoz.com | 12869 | 5.01 | 200 | HTML 5, English |
11045 | consumerlab.com | 12870 | 5.01 | 200 | HTML 5, English |
11046 | egr.msu.edu | 12871 | 5.01 | 200 | HTML 5, English |
11047 | gzip.org | 12872 | 5.01 | 200 | HTML 5, English |
11048 | americanlibrariesmagazine.org | 12873 | 5.01 | 200 | HTML 5, English |
11049 | stan.com.au | 12875 | 5.01 | 200 | HTML 5, English |
11050 | wcvb.com | 12876 | 5.01 | 200 | HTML 5, English |
11051 | insight.kellogg.northwestern.edu | 12877 | 5.01 | 200 | HTML 5, English |
11052 | codesector.com | 12878 | 5.01 | 200 | HTML 5, English |
11053 | kit.co | 12879 | 5.01 | 200 | HTML 5, No Lang |
11054 | jalbum.net | 12881 | 5.01 | 200 | HTML 5, English |
11055 | www-archive.mozilla.org | 12882 | 5.01 | 200 | English, Strict |
11056 | red-dot.org | 12884 | 5.01 | 200 | HTML 5, English |
11057 | osu.ppy.sh | 12886 | 5.01 | 200 | HTML 5, English |
11058 | eenews.net | 12887 | 5.01 | 200 | HTML 5, English |
11059 | aphp.fr | 12888 | 5.01 | 200 | HTML 5 |
11060 | marvell.com | 12889 | 5.01 | 200 | HTML 5, English |
11061 | ipbes.net | 12890 | 5.01 | 200 | HTML 5, English |
11062 | randalls.com | 12891 | 5.01 | 200 | English |
11063 | jboss.org | 12892 | 5.01 | 200 | HTML 5, English |
11064 | todocoleccion.net | 12893 | 5.01 | 200 | HTML 5 |
11065 | cps.gov.uk | 12894 | 5.01 | 200 | HTML 5, English |
11066 | ukrstat.gov.ua | 12895 | 5.01 | 200 | No Lang |
11067 | news.trust.org | 12896 | 5.01 | 200 | HTML 5, No Lang |
11068 | iq.com | 12897 | 5.01 | 200 | HTML 5, English |
11069 | telmex.com | 12898 | 5.01 | 200 | HTML 5 |
11070 | frametagmedia.com.au | 12900 | 5.01 | 200 | HTML 5, English |
11071 | splashlearn.com | 12901 | 5.01 | 200 | HTML 5, No Lang |
11072 | mailmunch.com | 12902 | 5.01 | 200 | HTML 5, No Lang |
11073 | rmi.org | 12903 | 5.01 | 200 | HTML 5, English |
11074 | seranking.com | 12904 | 5.01 | 200 | HTML 5, English |
11075 | web.stagram.com | 12905 | 5.01 | 200 | HTML 5, English |
11076 | richiejp.com | 12906 | 5.01 | 200 | HTML 5, English |
11077 | trog.qgl.org | 12907 | 5.01 | 200 | HTML 5, English |
11078 | thecookierookie.com | 12908 | 5.01 | 200 | HTML 5, English |
11079 | bikeindex.org | 12909 | 5.01 | 200 | HTML 5, English |
11080 | ntsb.gov | 12910 | 5.01 | 200 | English, Strict |
11081 | squeak.org | 12911 | 5.01 | 200 | HTML 5, No Lang |
11082 | medtronicdiabetes.com | 12912 | 5.01 | 200 | English |
11083 | nonda.co | 12913 | 5.01 | 200 | HTML 5, English |
11084 | mail-tester.com | 12914 | 5.01 | 200 | English, Transitional |
11085 | supertuxkart.net | 12915 | 5.01 | 200 | HTML 5, English |
11086 | hashnode.com | 12916 | 5.01 | 200 | HTML 5, English |
11087 | support.sas.com | 12917 | 5.01 | 200 | HTML 5, English |
11088 | ceu.edu | 12918 | 5.01 | 200 | HTML 5, English |
11089 | bmwi.de | 12919 | 5.01 | 200 | HTML 5 |
11090 | marvelsnap.com | 12920 | 5.01 | 200 | HTML 5, No Lang |
11091 | wiki.c2.com | 12921 | 5.01 | 200 | No Lang |
11092 | borlabs.io | 12922 | 5.01 | 200 | HTML 5, English |
11093 | cosstores.com | 12923 | 5.01 | 200 | HTML 5, English |
11094 | response.restoration.noaa.gov | 12924 | 5.01 | 200 | HTML 5, English |
11095 | nagad.com.bd | 12925 | 5.01 | 200 | HTML 5, English |
11096 | globalsecurity.org | 12926 | 5.01 | 200 | English |
11097 | identityblog.com | 12927 | 5.01 | 200 | HTML 5, English |
11098 | fotocommunity.de | 12928 | 5.01 | 200 | HTML 5 |
11099 | posthaus.com.br | 12930 | 5.01 | 200 | HTML 5 |
11100 | buttondown.email | 12931 | 5.01 | 200 | HTML 5, English |
Data from: Open PageRank