Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
11801 | demos.org | 13761 | 5.00 | 200 | HTML 5, English |
11802 | meneame.net | 13762 | 5.00 | 200 | HTML 5 |
11803 | clickondetroit.com | 13764 | 5.00 | 200 | HTML 5, English |
11804 | askmen.com | 13765 | 5.00 | 200 | HTML 5, English |
11805 | jacobin.com | 13766 | 5.00 | 200 | HTML 5, English |
11806 | houseofdeeprelax.com | 13767 | 5.00 | 200 | HTML 5 |
11807 | farm5.static.flickr.com | 13768 | 5.00 | 200 | No Lang |
11808 | apps.npr.org | 13769 | 5.00 | 200 | HTML 5, English |
11809 | 13wmaz.com | 13770 | 5.00 | 200 | HTML 5, English |
11810 | adplugg.com | 13771 | 5.00 | 200 | HTML 5, English |
11811 | irssi.org | 13772 | 5.00 | 200 | HTML 5, English |
11812 | gnavi.co.jp | 13773 | 5.00 | 200 | HTML 5 |
11813 | e-junkie.com | 13774 | 5.00 | 200 | HTML 5, English |
11814 | rosettastone.com | 13776 | 5.00 | 200 | HTML 5, English |
11815 | medicine.uiowa.edu | 13777 | 5.00 | 200 | HTML 5, English |
11816 | onf.ca | 13778 | 5.00 | 200 | HTML 5 |
11817 | leap.se | 13779 | 5.00 | 200 | HTML 5, English |
11818 | jumpsport.com | 13782 | 5.00 | 200 | HTML 5, English |
11819 | goodleap.com | 13783 | 5.00 | 200 | HTML 5, English |
11820 | jc.ne10.uol.com.br | 13784 | 5.00 | 200 | HTML 5 |
11821 | dslreports.com | 13785 | 5.00 | 200 | No Lang, Transitional |
11822 | aira.io | 13787 | 5.00 | 200 | HTML 5, English |
11823 | thegazette.com | 13789 | 5.00 | 200 | HTML 5, English |
11824 | wsu.edu | 13790 | 5.00 | 200 | HTML 5, English |
11825 | brackets.io | 13791 | 5.00 | 200 | HTML 5, English |
11826 | taniarascia.com | 13792 | 5.00 | 200 | HTML 5, No Lang |
11827 | eurecom.fr | 13793 | 5.00 | 200 | HTML 5, English |
11828 | usef.org | 13794 | 5.00 | 200 | No Lang |
11829 | deliciousdays.com | 13795 | 5.00 | 200 | English, Transitional |
11830 | classicfm.com | 13796 | 5.00 | 200 | HTML 5, English |
11831 | monei.com | 13798 | 5.00 | 200 | HTML 5, English |
11832 | iso20022.org | 13800 | 5.00 | 200 | HTML 5, English |
11833 | palletsprojects.com | 13801 | 5.00 | 200 | HTML 5, English |
11834 | cli.re | 13802 | 5.00 | 200 | HTML 5, English |
11835 | trailhead.salesforce.com | 13803 | 5.00 | 200 | HTML 5, English |
11836 | ahealthiermichigan.org | 13805 | 5.00 | 200 | HTML 5, English |
11837 | wfmu.org | 13806 | 5.00 | 200 | HTML 5, No Lang |
11838 | imec-int.com | 13807 | 5.00 | 200 | HTML 5, English |
11839 | hmetro.com.my | 13809 | 5.00 | 200 | HTML 5, English |
11840 | mohfw.gov.in | 13810 | 5.00 | 200 | English |
11841 | jamstack.org | 13811 | 5.00 | 200 | HTML 5, English |
11842 | yottapay.co.uk | 13812 | 5.00 | 200 | HTML 5, English |
11843 | cointracker.io | 13814 | 5.00 | 200 | HTML 5, English |
11844 | science.house.gov | 13815 | 5.00 | 200 | HTML 5, English |
11845 | fora.tv | 13818 | 5.00 | 200 | HTML 5 |
11846 | retailwire.com | 13819 | 5.00 | 200 | HTML 5, English |
11847 | geeky-gadgets.com | 13821 | 5.00 | 200 | HTML 5, English |
11848 | thispersondoesnotexist.com | 13823 | 5.00 | 200 | No Lang |
11849 | ny.gov | 13824 | 5.00 | 200 | HTML 5, English |
11850 | hemnet.se | 13825 | 5.00 | 200 | HTML 5 |
11851 | historicengland.org.uk | 13827 | 5.00 | 200 | HTML 5, English |
11852 | sfstandard.com | 13828 | 5.00 | 200 | HTML 5, English |
11853 | observation.org | 13829 | 5.00 | 200 | HTML 5, English |
11854 | sentinelone.com | 13830 | 5.00 | 200 | HTML 5, English |
11855 | sdpnoticias.com | 13832 | 5.00 | 200 | HTML 5 |
11856 | recht.nrw.de | 13833 | 5.00 | 200 | HTML 5 |
11857 | carbonfootprint.com | 13835 | 5.00 | 200 | HTML 5, No Lang |
11858 | ceh.ac.uk | 13836 | 5.00 | 200 | HTML 5, English |
11859 | aleteia.org | 13837 | 5.00 | 200 | HTML 5, English |
11860 | fiveguys.com | 13838 | 5.00 | 200 | HTML 5, English |
11861 | live.com | 13839 | 5.00 | 200 | HTML 5, English |
11862 | mackie.com | 13840 | 5.00 | 200 | HTML 5, English |
11863 | 1.usa.gov | 13841 | 5.00 | 200 | HTML 5, English |
11864 | lysator.liu.se | 13842 | 5.00 | 200 | |
11865 | metopera.org | 13843 | 5.00 | 200 | HTML 5, English |
11866 | azuremagazine.com | 13844 | 5.00 | 200 | HTML 5, No Lang |
11867 | underscores.me | 13845 | 5.00 | 200 | HTML 5, English |
11868 | immaf.org | 13846 | 5.00 | 200 | HTML 5, English |
11869 | alfabank.ru | 13847 | 5.00 | 200 | HTML 5 |
11870 | beachbody.com | 13848 | 5.00 | 200 | HTML 5, English |
11871 | merlin.allaboutbirds.org | 13850 | 5.00 | 200 | HTML 5, English |
11872 | france.fr | 13851 | 5.00 | 200 | HTML 5, English |
11873 | procreate.art | 13852 | 5.00 | 200 | HTML 5, English |
11874 | share.getcloudapp.com | 13853 | 5.00 | 200 | HTML 5, English |
11875 | soundhound.com | 13854 | 5.00 | 200 | HTML 5, English |
11876 | skyvector.com | 13855 | 5.00 | 200 | HTML 5, No Lang |
11877 | smartrecruiters.com | 13856 | 5.00 | 200 | HTML 5, English |
11878 | publico.es | 13857 | 5.00 | 200 | HTML 5 |
11879 | laika.com | 13858 | 5.00 | 200 | HTML 5, English |
11880 | economia.uol.com.br | 13859 | 5.00 | 200 | HTML 5 |
11881 | ucsdnews.ucsd.edu | 13860 | 5.00 | 200 | HTML 5, English |
11882 | forums.iis.net | 13861 | 5.00 | 200 | HTML 5, English |
11883 | unboundmedicine.com | 13862 | 5.00 | 200 | HTML 5, English |
11884 | rabble.ca | 13864 | 5.00 | 200 | HTML 5, English |
11885 | ts.fi | 13865 | 5.00 | 200 | HTML 5 |
11886 | trutv.com | 13866 | 5.00 | 200 | HTML 5, English |
11887 | cure53.de | 13867 | 5.00 | 200 | HTML 5, English |
11888 | openwetware.org | 13868 | 5.00 | 200 | HTML 5, English |
11889 | missuniverse.com | 13869 | 5.00 | 200 | HTML 5, English |
11890 | faroutmagazine.co.uk | 13870 | 5.00 | 200 | HTML 5, English |
11891 | berkshireeagle.com | 13871 | 5.00 | 200 | HTML 5, English |
11892 | statssa.gov.za | 13872 | 5.00 | 200 | No Lang |
11893 | hannover.de | 13873 | 5.00 | 200 | HTML 5 |
11894 | ing.dk | 13874 | 5.00 | 200 | HTML 5 |
11895 | peakery.com | 13875 | 5.00 | 200 | English, Transitional |
11896 | leadertask.com | 13876 | 5.00 | 200 | HTML 5, English |
11897 | econlib.org | 13879 | 5.00 | 200 | HTML 5, English |
11898 | dallasobserver.com | 13881 | 5.00 | 200 | HTML 5, English |
11899 | help.zoho.com | 13882 | 5.00 | 200 | HTML 5, English |
11900 | minneapolisfed.org | 13884 | 5.00 | 200 | HTML 5, English |
Data from: Open PageRank