Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
6701 | wired.it | 7855 | 5.18 | 200 | HTML 5 |
6702 | money.com | 7856 | 5.18 | 200 | HTML 5, No Lang |
6703 | species.wikimedia.org | 7857 | 5.18 | 200 | HTML 5, No Lang |
6704 | fr.wiktionary.org | 7858 | 5.18 | 200 | HTML 5, No Lang |
6705 | pharmaceutical-journal.com | 7859 | 5.18 | 200 | HTML 5, English |
6706 | lumalabs.ai | 7860 | 5.18 | 200 | HTML 5, English |
6707 | polyu.edu.hk | 7861 | 5.18 | 200 | HTML 5, No Lang |
6708 | newschallenge.org | 7862 | 5.18 | 200 | HTML 5, English |
6709 | summitlighthouse.org | 7863 | 5.18 | 200 | HTML 5, English |
6710 | nefisyemektarifleri.com | 7864 | 5.18 | 200 | HTML 5 |
6711 | ara.cat | 7866 | 5.18 | 200 | HTML 5 |
6712 | savethechildren.org | 7867 | 5.18 | 200 | HTML 5, English |
6713 | nst.com.my | 7868 | 5.18 | 200 | HTML 5, English |
6714 | onextrapixel.com | 7869 | 5.18 | 200 | HTML 5, English |
6715 | thenorthernecho.co.uk | 7870 | 5.18 | 200 | HTML 5, English |
6716 | bandlab.com | 7871 | 5.18 | 200 | HTML 5, English |
6717 | jbs.cam.ac.uk | 7872 | 5.18 | 200 | HTML 5, English |
6718 | wigle.net | 7873 | 5.18 | 200 | HTML 5, English |
6719 | womenshealthmag.com | 7874 | 5.18 | 200 | HTML 5, English |
6720 | opb.org | 7876 | 5.18 | 200 | HTML 5, English |
6721 | pmc.com | 7877 | 5.18 | 200 | HTML 5, No Lang |
6722 | canon-europe.com | 7879 | 5.18 | 200 | HTML 5, No Lang |
6723 | michaelkors.com | 7880 | 5.18 | 200 | HTML 5, English |
6724 | users.ox.ac.uk | 7881 | 5.18 | 200 | HTML 5, English |
6725 | rbcroyalbank.com | 7882 | 5.18 | 200 | No Lang, Transitional |
6726 | haskell.org | 7883 | 5.18 | 200 | HTML 5, English |
6727 | earthlink.net | 7884 | 5.18 | 200 | HTML 5, English |
6728 | sourcefabric.org | 7885 | 5.18 | 200 | HTML 5, English |
6729 | itftennis.com | 7886 | 5.18 | 200 | No Lang |
6730 | harpers.org | 7887 | 5.18 | 200 | HTML 5, English |
6731 | randomhouse.de | 7888 | 5.18 | 200 | HTML 5 |
6732 | traveloregon.com | 7889 | 5.18 | 200 | HTML 5, English |
6733 | georgetown.edu | 7890 | 5.18 | 200 | HTML 5, English |
6734 | iconscout.com | 7891 | 5.18 | 200 | HTML 5, English |
6735 | volcano.si.edu | 7892 | 5.18 | 200 | English |
6736 | livre.fnac.com | 7893 | 5.18 | 200 | |
6737 | freedownloadmanager.org | 7894 | 5.18 | 200 | HTML 5, English |
6738 | talkspace.com | 7895 | 5.18 | 200 | HTML 5, English |
6739 | pw.edu.pl | 7896 | 5.18 | 200 | HTML 5 |
6740 | orlandosentinel.com | 7897 | 5.18 | 200 | HTML 5, English |
6741 | onlinebooks.library.upenn.edu | 7899 | 5.18 | 200 | HTML 5, English |
6742 | civicrm.org | 7900 | 5.18 | 200 | HTML 5, English |
6743 | bugzilla.redhat.com | 7901 | 5.18 | 200 | HTML 5, English |
6744 | s3.us-east-2.amazonaws.com | 7902 | 5.18 | 200 | HTML 5, English |
6745 | linux.die.net | 7903 | 5.18 | 200 | HTML 5, English |
6746 | is.muni.cz | 7904 | 5.18 | 200 | HTML 5 |
6747 | riotgames.com | 7905 | 5.18 | 200 | HTML 5, English |
6748 | fcps.edu | 7906 | 5.18 | 200 | HTML 5, English |
6749 | ab-inbev.com | 7907 | 5.18 | 200 | HTML 5, English |
6750 | flickriver.com | 7908 | 5.18 | 200 | No Lang, Transitional |
6751 | modpagespeed.com | 7909 | 5.18 | 200 | No Lang |
6752 | tv2.no | 7910 | 5.18 | 200 | HTML 5 |
6753 | complianz.io | 7911 | 5.18 | 200 | HTML 5, English |
6754 | customink.com | 7912 | 5.18 | 200 | HTML 5, English |
6755 | sos.oregon.gov | 7913 | 5.18 | 200 | HTML 5, English |
6756 | dagbladet.no | 7915 | 5.18 | 200 | HTML 5 |
6757 | hackaday.io | 7916 | 5.18 | 200 | HTML 5, English |
6758 | vtiger.com | 7917 | 5.18 | 200 | HTML 5, English |
6759 | snapcraft.io | 7920 | 5.18 | 200 | HTML 5, English |
6760 | readymag.com | 7921 | 5.18 | 200 | HTML 5, No Lang |
6761 | hs.fi | 7922 | 5.18 | 200 | HTML 5 |
6762 | travel.stackexchange.com | 7924 | 5.18 | 200 | HTML 5, English |
6763 | cdn.knightlab.com | 7925 | 5.18 | 200 | HTML 5, No Lang |
6764 | kixeye.com | 7926 | 5.18 | 200 | HTML 5, No Lang |
6765 | sos.wa.gov | 7927 | 5.18 | 200 | HTML 5, English |
6766 | easterseals.com | 7928 | 5.18 | 200 | HTML 5, English |
6767 | foundation.app | 7929 | 5.18 | 200 | HTML 5, English |
6768 | nissan-global.com | 7930 | 5.18 | 200 | HTML 5, English |
6769 | cc.gatech.edu | 7931 | 5.18 | 200 | HTML 5, English |
6770 | met.ie | 7932 | 5.18 | 200 | English |
6771 | thesocietypages.org | 7933 | 5.18 | 200 | HTML 5, English |
6772 | fox.com | 7934 | 5.18 | 200 | HTML 5, English |
6773 | spiceworks.com | 7935 | 5.18 | 200 | HTML 5, English |
6774 | amara.org | 7936 | 5.18 | 200 | HTML 5, No Lang |
6775 | mediamatters.org | 7937 | 5.18 | 200 | English |
6776 | zipcar.com | 7938 | 5.18 | 200 | HTML 5, English |
6777 | learn.wordpress.org | 7939 | 5.18 | 200 | HTML 5, English |
6778 | foodstandards.gov.au | 7940 | 5.18 | 200 | HTML 5, English |
6779 | jslint.com | 7941 | 5.18 | 200 | HTML 5, English |
6780 | workana.com | 7942 | 5.18 | 200 | HTML 5, English |
6781 | wisc.edu | 7943 | 5.18 | 200 | HTML 5, English |
6782 | kentucky.com | 7944 | 5.18 | 200 | HTML 5, English |
6783 | instituteforgovernment.org.uk | 7945 | 5.18 | 200 | HTML 5, English |
6784 | graphis.com | 7947 | 5.18 | 200 | HTML 5, English |
6785 | seedprod.com | 7948 | 5.18 | 200 | HTML 5, English |
6786 | kth.se | 7949 | 5.18 | 200 | HTML 5 |
6787 | law.lis.virginia.gov | 7950 | 5.18 | 200 | HTML 5, English |
6788 | oll.libertyfund.org | 7951 | 5.18 | 200 | HTML 5, No Lang |
6789 | ringcentral.com | 7952 | 5.18 | 200 | HTML 5, English |
6790 | lazada.vn | 7953 | 5.18 | 200 | HTML 5, No Lang |
6791 | code.msdn.microsoft.com | 7954 | 5.18 | 200 | HTML 5, English |
6792 | noz.de | 7955 | 5.18 | 200 | HTML 5 |
6793 | drugabuse.gov | 7956 | 5.18 | 200 | HTML 5, English |
6794 | rust-lang.org | 7958 | 5.18 | 200 | HTML 5, English |
6795 | healthcentral.com | 7959 | 5.18 | 200 | HTML 5, English |
6796 | dreamworksanimation.com | 7961 | 5.18 | 200 | HTML 5, English |
6797 | kit.edu | 7963 | 5.18 | 200 | HTML 5 |
6798 | hypothes.is | 7964 | 5.18 | 200 | HTML 5, English |
6799 | cs.unc.edu | 7965 | 5.18 | 200 | HTML 5, English |
6800 | dortmund.de | 7966 | 5.18 | 200 | HTML 5 |
Data from: Open PageRank