Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
17001 | dsc.community.dev | 19815 | 4.88 | 200 | HTML 5, English |
17002 | ambient-mixer.com | 19816 | 4.88 | 200 | HTML 5, English |
17003 | nature.berkeley.edu | 19817 | 4.88 | 200 | HTML 5, English |
17004 | stats.nba.com | 19818 | 4.88 | 200 | HTML 5, English |
17005 | what-if.xkcd.com | 19819 | 4.88 | 200 | HTML 5, No Lang |
17006 | directory.libsyn.com | 19820 | 4.88 | 200 | HTML 5, English |
17007 | cs.arizona.edu | 19821 | 4.88 | 200 | HTML 5, English |
17008 | hawaiianairlines.com | 19822 | 4.88 | 200 | HTML 5, English |
17009 | accessconsciousness.com | 19823 | 4.88 | 200 | HTML 5, No Lang |
17010 | daac.ornl.gov | 19824 | 4.88 | 200 | HTML 5, English |
17011 | windley.com | 19827 | 4.88 | 200 | HTML 5, English |
17012 | searchcio.techtarget.com | 19828 | 4.88 | 200 | HTML 5, English |
17013 | snort.social | 19829 | 4.88 | 200 | HTML 5, English |
17014 | usask.ca | 19830 | 4.88 | 200 | HTML 5, English |
17015 | inhope.org | 19833 | 4.88 | 200 | HTML 5, No Lang |
17016 | zello.com | 19834 | 4.88 | 200 | HTML 5, English |
17017 | digital.bodleian.ox.ac.uk | 19835 | 4.88 | 200 | HTML 5, English |
17018 | bcbstx.com | 19836 | 4.88 | 200 | HTML 5, English |
17019 | socratic.org | 19837 | 4.88 | 200 | HTML 5, No Lang |
17020 | en.reset.org | 19838 | 4.88 | 200 | HTML 5, English |
17021 | content.naic.org | 19839 | 4.88 | 200 | HTML 5, English |
17022 | tritondigital.com | 19840 | 4.88 | 200 | HTML 5, English |
17023 | atap.google.com | 19841 | 4.88 | 200 | HTML 5, English |
17024 | woodcraft.com | 19842 | 4.88 | 200 | HTML 5, English |
17025 | historylink.org | 19843 | 4.88 | 200 | HTML 5, No Lang |
17026 | docs.nextcloud.com | 19844 | 4.88 | 200 | HTML 5, English |
17027 | spacejam.com | 19845 | 4.88 | 200 | HTML 5, English |
17028 | misinforeview.hks.harvard.edu | 19846 | 4.88 | 200 | HTML 5, English |
17029 | pioneerdj.com | 19847 | 4.88 | 200 | HTML 5, No Lang |
17030 | mamedev.org | 19848 | 4.88 | 200 | HTML 5, English |
17031 | blog.npmjs.org | 19849 | 4.88 | 200 | HTML 5, No Lang |
17032 | clym.io | 19850 | 4.88 | 200 | HTML 5, English |
17033 | mountainproject.com | 19852 | 4.88 | 200 | HTML 5, English |
17034 | geometrygames.org | 19853 | 4.88 | 200 | English, Strict |
17035 | blueimp.github.io | 19854 | 4.88 | 200 | HTML 5, English |
17036 | covid19.ca.gov | 19856 | 4.88 | 200 | HTML 5, English |
17037 | de-ch.wordpress.org | 19857 | 4.88 | 200 | HTML 5 |
17038 | qq.com | 19858 | 4.88 | 200 | HTML 5 |
17039 | uapress.arizona.edu | 19859 | 4.88 | 200 | HTML 5, English |
17040 | cuni.cz | 19860 | 4.88 | 200 | HTML 5 |
17041 | qunitjs.com | 19861 | 4.88 | 200 | HTML 5, English |
17042 | surniaulula.com | 19862 | 4.88 | 200 | HTML 5, English |
17043 | rega.ch | 19863 | 4.88 | 200 | HTML 5, English |
17044 | dos.fl.gov | 19864 | 4.88 | 200 | HTML 5, English |
17045 | sandiego.edu | 19865 | 4.88 | 200 | HTML 5, English |
17046 | vnf.fr | 19866 | 4.88 | 200 | HTML 5 |
17047 | barrys.com | 19867 | 4.88 | 200 | HTML 5, English |
17048 | panasonic.net | 19868 | 4.88 | 200 | HTML 5, English |
17049 | ctdbase.org | 19870 | 4.88 | 200 | HTML 5, English |
17050 | mercy.net | 19871 | 4.88 | 200 | HTML 5, English |
17051 | spinabifidaassociation.org | 19872 | 4.88 | 200 | HTML 5, English |
17052 | kronos.com | 19873 | 4.88 | 200 | HTML 5, English |
17053 | danielmiessler.com | 19874 | 4.88 | 200 | HTML 5, English |
17054 | wn.com | 19876 | 4.88 | 200 | HTML 5, English |
17055 | prdownloads.sourceforge.net | 19877 | 4.88 | 200 | HTML 5, English |
17056 | filmsite.org | 19878 | 4.88 | 200 | English, Strict |
17057 | simpleicons.org | 19879 | 4.88 | 200 | HTML 5, No Lang |
17058 | guesty.com | 19880 | 4.88 | 200 | HTML 5, English |
17059 | av.tib.eu | 19882 | 4.88 | 200 | HTML 5, English |
17060 | csounds.com | 19884 | 4.88 | 200 | No Lang |
17061 | wpgetpaid.com | 19885 | 4.88 | 200 | HTML 5, English |
17062 | guiarepsol.com | 19887 | 4.88 | 200 | HTML 5 |
17063 | yougov.com | 19888 | 4.88 | 200 | HTML 5, English |
17064 | dial.uclouvain.be | 19889 | 4.88 | 200 | No Lang |
17065 | localise.biz | 19890 | 4.88 | 200 | HTML 5, English |
17066 | docs.blender.org | 19892 | 4.88 | 200 | HTML 5, English |
17067 | garnierusa.com | 19893 | 4.88 | 200 | HTML 5, No Lang |
17068 | runnersworld.de | 19894 | 4.88 | 200 | HTML 5 |
17069 | currency.wiki | 19895 | 4.88 | 200 | HTML 5, English |
17070 | justice.gov.za | 19896 | 4.88 | 200 | English |
17071 | utalk.com | 19897 | 4.88 | 200 | HTML 5, English |
17072 | thequint.com | 19898 | 4.88 | 200 | HTML 5, English |
17073 | bcci.tv | 19899 | 4.88 | 200 | No Lang |
17074 | amarchitrakatha.com | 19900 | 4.88 | 200 | HTML 5, English |
17075 | travail-emploi.gouv.fr | 19901 | 4.88 | 200 | HTML 5 |
17076 | eventbrite.es | 19902 | 4.88 | 200 | HTML 5, No Lang |
17077 | brownbook.net | 19903 | 4.88 | 200 | HTML 5, English |
17078 | benlcollins.com | 19904 | 4.88 | 200 | HTML 5, English |
17079 | tecmundo.com.br | 19905 | 4.88 | 200 | HTML 5 |
17080 | wsd.gov.hk | 19907 | 4.88 | 200 | HTML 5, English |
17081 | maxhealthcare.in | 19908 | 4.88 | 200 | HTML 5, English |
17082 | ennaharonline.com | 19909 | 4.88 | 200 | HTML 5 |
17083 | nettavisen.no | 19910 | 4.88 | 200 | HTML 5 |
17084 | auphonic.com | 19911 | 4.88 | 200 | HTML 5, English |
17085 | journals.ku.edu | 19912 | 4.88 | 200 | HTML 5, English |
17086 | webglreport.com | 19913 | 4.88 | 200 | English |
17087 | datasetsearch.research.google.com | 19914 | 4.88 | 200 | HTML 5, English |
17088 | firebirdsql.org | 19915 | 4.88 | 200 | HTML 5, English |
17089 | academicworks.cuny.edu | 19916 | 4.88 | 200 | HTML 5, English |
17090 | nmsassistant.com | 19918 | 4.88 | 200 | HTML 5, English |
17091 | mom.me | 19919 | 4.88 | 200 | HTML 5, English |
17092 | rtings.com | 19920 | 4.88 | 200 | HTML 5, English |
17093 | acquire.io | 19922 | 4.88 | 200 | HTML 5, English |
17094 | epw.senate.gov | 19923 | 4.88 | 200 | HTML 5, English |
17095 | dbeaver.io | 19924 | 4.88 | 200 | HTML 5, English |
17096 | bioinformatics.org | 19925 | 4.88 | 200 | HTML 5, English |
17097 | cookaround.com | 19926 | 4.88 | 200 | HTML 5 |
17098 | huffingtonpost.it | 19927 | 4.88 | 200 | HTML 5 |
17099 | twobrothersindiashop.com | 19928 | 4.88 | 200 | HTML 5, English |
17100 | stmuv.bayern.de | 19930 | 4.88 | 200 |
Data from: Open PageRank