Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
4001 | weizmann.ac.il | 4762 | 5.38 | 200 | HTML 5, English |
4002 | coronavirus.jhu.edu | 4764 | 5.38 | 200 | HTML 5, English |
4003 | stjude.org | 4765 | 5.38 | 200 | English, Strict |
4004 | stern.de | 4766 | 5.38 | 200 | HTML 5 |
4005 | eventbrite.com.au | 4767 | 5.38 | 200 | HTML 5, No Lang |
4006 | worldvision.org | 4768 | 5.38 | 200 | HTML 5, English |
4007 | marketingweek.com | 4769 | 5.38 | 200 | HTML 5, English |
4008 | viz.com | 4770 | 5.38 | 200 | HTML 5, English |
4009 | ru.nl | 4772 | 5.38 | 200 | HTML 5 |
4010 | powr.io | 4773 | 5.38 | 200 | HTML 5, English |
4011 | lexology.com | 4775 | 5.38 | 200 | HTML 5, English |
4012 | nmap.org | 4776 | 5.38 | 200 | HTML 5, English |
4013 | pinterest.ru | 4778 | 5.38 | 200 | HTML 5, English |
4014 | ctvnews.ca | 4780 | 5.38 | 200 | HTML 5, English |
4015 | ct.de | 4781 | 5.38 | 200 | HTML 5 |
4016 | wpml.org | 4782 | 5.38 | 200 | HTML 5, English |
4017 | ny.curbed.com | 4783 | 5.38 | 200 | HTML 5, English |
4018 | customer.io | 4784 | 5.38 | 200 | HTML 5, English |
4019 | zenn.dev | 4785 | 5.38 | 200 | HTML 5 |
4020 | caltech.edu | 4786 | 5.38 | 200 | HTML 5, English |
4021 | sandals.com | 4787 | 5.38 | 200 | HTML 5, English |
4022 | umich.edu | 4788 | 5.38 | 200 | HTML 5, English |
4023 | earthdata.nasa.gov | 4789 | 5.38 | 200 | HTML 5, English |
4024 | rbc.ru | 4790 | 5.38 | 200 | HTML 5 |
4025 | lea.verou.me | 4791 | 5.38 | 200 | HTML 5, English |
4026 | mines.edu | 4794 | 5.38 | 200 | HTML 5, English |
4027 | datatables.net | 4795 | 5.38 | 200 | HTML 5, English |
4028 | laws-lois.justice.gc.ca | 4796 | 5.38 | 200 | HTML 5, English |
4029 | myrecipes.com | 4797 | 5.38 | 200 | HTML 5, English |
4030 | groovehq.com | 4798 | 5.38 | 200 | HTML 5, English |
4031 | tagesspiegel.de | 4799 | 5.38 | 200 | HTML 5 |
4032 | yourtango.com | 4800 | 5.38 | 200 | HTML 5, English |
4033 | civitatis.com | 4801 | 5.38 | 200 | HTML 5, English |
4034 | threadreaderapp.com | 4802 | 5.38 | 200 | HTML 5, No Lang |
4035 | consumer.huawei.com | 4803 | 5.38 | 200 | HTML 5, English |
4036 | news.umich.edu | 4805 | 5.38 | 200 | HTML 5, English |
4037 | migaweb.de | 4806 | 5.38 | 200 | HTML 5 |
4038 | archpaper.com | 4807 | 5.38 | 200 | HTML 5, English |
4039 | websiteplanet.com | 4810 | 5.38 | 200 | HTML 5, English |
4040 | aa.org | 4811 | 5.38 | 200 | HTML 5, English |
4041 | eatsmarter.de | 4812 | 5.38 | 200 | HTML 5 |
4042 | townhall.com | 4813 | 5.38 | 200 | HTML 5, English |
4043 | tencent.com | 4814 | 5.38 | 200 | English |
4044 | sketch.com | 4815 | 5.38 | 200 | HTML 5, English |
4045 | drugs.com | 4816 | 5.38 | 200 | HTML 5, English |
4046 | ed.gov | 4817 | 5.38 | 200 | HTML 5, English |
4047 | activision.com | 4818 | 5.38 | 200 | HTML 5, English |
4048 | sciencebasedmedicine.org | 4819 | 5.38 | 200 | HTML 5, English |
4049 | alz.org | 4820 | 5.38 | 200 | HTML 5, English |
4050 | bart.gov | 4821 | 5.38 | 200 | HTML 5, English |
4051 | poe.com | 4822 | 5.38 | 200 | HTML 5, English |
4052 | designtaxi.com | 4823 | 5.38 | 200 | HTML 5, No Lang |
4053 | benchmarkemail.com | 4824 | 5.38 | 200 | HTML 5, English |
4054 | view.genial.ly | 4825 | 5.38 | 200 | HTML 5, English |
4055 | arbeitsagentur.de | 4826 | 5.38 | 200 | HTML 5, No Lang |
4056 | panasonic.jp | 4827 | 5.38 | 200 | HTML 5 |
4057 | disneyland.disney.go.com | 4828 | 5.38 | 200 | HTML 5, English |
4058 | bath.ac.uk | 4829 | 5.38 | 200 | HTML 5, English |
4059 | derstandard.at | 4830 | 5.38 | 200 | HTML 5 |
4060 | icao.int | 4831 | 5.38 | 200 | English |
4061 | siteinspire.com | 4832 | 5.38 | 200 | HTML 5, English |
4062 | help.doordash.com | 4833 | 5.38 | 200 | HTML 5, English |
4063 | 20minutes.fr | 4834 | 5.38 | 200 | HTML 5 |
4064 | sverigesradio.se | 4835 | 5.38 | 200 | HTML 5 |
4065 | imagebam.com | 4836 | 5.38 | 200 | HTML 5, English |
4066 | livejournal.com | 4837 | 5.38 | 200 | HTML 5, English |
4067 | scholar.google.co.uk | 4838 | 5.38 | 200 | HTML 5, No Lang |
4068 | tomtom.com | 4839 | 5.37 | 200 | HTML 5, No Lang |
4069 | maxon.net | 4840 | 5.37 | 200 | HTML 5, English |
4070 | anker.com | 4841 | 5.37 | 200 | HTML 5, English |
4071 | ar.wikipedia.org | 4842 | 5.37 | 200 | HTML 5, No Lang |
4072 | internetlivestats.com | 4843 | 5.37 | 200 | HTML 5, English |
4073 | ofcom.org.uk | 4844 | 5.37 | 200 | HTML 5, English |
4074 | imageoptim.com | 4845 | 5.37 | 200 | HTML 5, English |
4075 | opinionator.blogs.nytimes.com | 4847 | 5.37 | 200 | HTML 5, English |
4076 | wendys.com | 4848 | 5.37 | 200 | HTML 5, English |
4077 | econstor.eu | 4849 | 5.37 | 200 | HTML 5, English |
4078 | news.bitcoin.com | 4850 | 5.37 | 200 | HTML 5, No Lang |
4079 | keepass.info | 4852 | 5.37 | 200 | HTML 5, English |
4080 | bugs.php.net | 4853 | 5.37 | 200 | HTML 5, English |
4081 | politiken.dk | 4854 | 5.37 | 200 | HTML 5 |
4082 | profiles.wordpress.org | 4855 | 5.37 | 200 | HTML 5, English |
4083 | globalforestwatch.org | 4856 | 5.37 | 200 | HTML 5, English |
4084 | dnainfo.com | 4857 | 5.37 | 200 | HTML 5, English |
4085 | theme-junkie.com | 4858 | 5.37 | 200 | HTML 5, English |
4086 | rug.nl | 4859 | 5.37 | 200 | HTML 5, English |
4087 | gimletmedia.com | 4860 | 5.37 | 200 | HTML 5, English |
4088 | ring.com | 4862 | 5.37 | 200 | HTML 5, English |
4089 | starwalk.space | 4863 | 5.37 | 200 | HTML 5, English |
4090 | thetimes.com | 4864 | 5.37 | 200 | HTML 5, English |
4091 | blog.archive.org | 4865 | 5.37 | 200 | HTML 5, English |
4092 | bedbathandbeyond.com | 4866 | 5.37 | 200 | HTML 5, English |
4093 | glossier.com | 4867 | 5.37 | 200 | HTML 5, English |
4094 | nrk.no | 4868 | 5.37 | 200 | HTML 5 |
4095 | reports.weforum.org | 4869 | 5.37 | 200 | HTML 5, English |
4096 | ine.es | 4870 | 5.37 | 200 | HTML 5 |
4097 | rss.com | 4871 | 5.37 | 200 | HTML 5, English |
4098 | juniperresearch.com | 4872 | 5.37 | 200 | No Lang |
4099 | ahrq.gov | 4873 | 5.37 | 200 | HTML 5, English |
4100 | devowl.io | 4874 | 5.37 | 200 | HTML 5, English |
Data from: Open PageRank