Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16901 | wellnessliving.com | 19694 | 4.90 | 200 | HTML 5, English |
16902 | notiz.blog | 19695 | 4.90 | 200 | HTML 5 |
16903 | hackteria.org | 19696 | 4.90 | 200 | English, Transitional |
16904 | avantlink.com | 19697 | 4.90 | 200 | HTML 5, English |
16905 | au.kddi.com | 19698 | 4.90 | 200 | HTML 5 |
16906 | fatsecret.com | 19699 | 4.90 | 200 | No Lang, Transitional |
16907 | acaai.org | 19700 | 4.90 | 200 | HTML 5, English |
16908 | travelperk.com | 19701 | 4.90 | 200 | HTML 5, English |
16909 | pages.mtu.edu | 19702 | 4.90 | 200 | HTML 5, English |
16910 | nationalgrid.com | 19704 | 4.90 | 200 | HTML 5, English |
16911 | hikingproject.com | 19705 | 4.90 | 200 | HTML 5, English |
16912 | neighborly.com | 19706 | 4.88 | 200 | HTML 5, English |
16913 | kinobox.cz | 19707 | 4.88 | 200 | HTML 5 |
16914 | jfrog.com | 19708 | 4.88 | 200 | HTML 5, English |
16915 | sweetcsdesigns.com | 19709 | 4.88 | 200 | HTML 5, English |
16916 | fed.brid.gy | 19711 | 4.88 | 200 | HTML 5, No Lang |
16917 | lyngsat.com | 19712 | 4.88 | 200 | No Lang |
16918 | mrl.nyu.edu | 19713 | 4.88 | 200 | No Lang, Strict |
16919 | municipiodequeretaro.gob.mx | 19714 | 4.88 | 200 | HTML 5 |
16920 | emacswiki.org | 19715 | 4.88 | 200 | HTML 5, No Lang |
16921 | encoding.spec.whatwg.org | 19716 | 4.88 | 200 | HTML 5, English |
16922 | forms.app | 19717 | 4.88 | 200 | HTML 5, English |
16923 | zhaw.ch | 19718 | 4.88 | 200 | HTML 5 |
16924 | c2.synology.com | 19720 | 4.88 | 200 | HTML 5, English |
16925 | theacsi.org | 19722 | 4.88 | 200 | HTML 5, English |
16926 | ftp.ncbi.nih.gov | 19723 | 4.88 | 200 | No Lang |
16927 | skatteetaten.no | 19724 | 4.88 | 200 | HTML 5 |
16928 | cru.org | 19725 | 4.88 | 200 | HTML 5, English |
16929 | opennet.ru | 19726 | 4.88 | 200 | No Lang |
16930 | marketing-interactive.com | 19727 | 4.88 | 200 | HTML 5, English |
16931 | naco.org | 19728 | 4.88 | 200 | English |
16932 | economix.blogs.nytimes.com | 19729 | 4.88 | 200 | HTML 5, English |
16933 | blogs.uoregon.edu | 19730 | 4.88 | 200 | No Lang |
16934 | telford.gov.uk | 19731 | 4.88 | 200 | HTML 5, English |
16935 | huntr.dev | 19732 | 4.88 | 200 | HTML 5, English |
16936 | cw33.com | 19733 | 4.88 | 200 | HTML 5, English |
16937 | bostondynamics.com | 19734 | 4.88 | 200 | HTML 5, English |
16938 | sunexpress.com | 19735 | 4.88 | 200 | HTML 5, English |
16939 | israelhayom.com | 19736 | 4.88 | 200 | HTML 5, English |
16940 | fchampalimaud.org | 19737 | 4.88 | 200 | English |
16941 | gnc.com | 19738 | 4.88 | 200 | HTML 5, English |
16942 | patriots.com | 19740 | 4.88 | 200 | HTML 5, English |
16943 | tci-thaijo.org | 19741 | 4.88 | 200 | HTML 5, English |
16944 | portal.gov.cz | 19742 | 4.88 | 200 | HTML 5 |
16945 | stylight.com | 19743 | 4.88 | 200 | HTML 5, English |
16946 | ff.garena.com | 19745 | 4.88 | 200 | HTML 5, English |
16947 | air.mozilla.org | 19746 | 4.88 | 200 | English, Strict |
16948 | nativetech.org | 19747 | 4.88 | 200 | No Lang |
16949 | tijd.be | 19748 | 4.88 | 200 | HTML 5 |
16950 | factcheck.afp.com | 19750 | 4.88 | 200 | HTML 5, English |
16951 | dataguidance.com | 19751 | 4.88 | 200 | HTML 5, English |
16952 | repositorio-aberto.up.pt | 19752 | 4.88 | 200 | HTML 5, No Lang |
16953 | hahaha.com | 19753 | 4.88 | 200 | HTML 5 |
16954 | sigir.org | 19754 | 4.88 | 200 | HTML 5, English |
16955 | help.mikrotik.com | 19757 | 4.88 | 200 | HTML 5, English |
16956 | baochinhphu.vn | 19758 | 4.88 | 200 | HTML 5 |
16957 | ite.org | 19759 | 4.88 | 200 | HTML 5, English |
16958 | wesnoth.org | 19760 | 4.88 | 200 | HTML 5, English |
16959 | bremen.de | 19762 | 4.88 | 200 | HTML 5 |
16960 | wesleyan.edu | 19763 | 4.88 | 200 | HTML 5, English |
16961 | dermstore.com | 19764 | 4.88 | 200 | HTML 5, English |
16962 | eco-business.com | 19765 | 4.88 | 200 | HTML 5, English |
16963 | timedoctor.com | 19766 | 4.88 | 200 | HTML 5, No Lang |
16964 | tamm.abudhabi | 19767 | 4.88 | 200 | HTML 5, English |
16965 | ktm.com | 19769 | 4.88 | 200 | HTML 5, English |
16966 | joda-time.sourceforge.net | 19770 | 4.88 | 200 | English, Transitional |
16967 | thenewsminute.com | 19774 | 4.88 | 200 | HTML 5, English |
16968 | traffic.org | 19775 | 4.88 | 200 | HTML 5, English |
16969 | drugfree.org | 19776 | 4.88 | 200 | HTML 5, English |
16970 | bmel.de | 19780 | 4.88 | 200 | HTML 5, English |
16971 | tatacommunications.com | 19781 | 4.88 | 200 | HTML 5, English |
16972 | infograph.venngage.com | 19782 | 4.88 | 200 | HTML 5, English |
16973 | news.linkedin.com | 19783 | 4.88 | 200 | HTML 5, English |
16974 | campingandcaravanningclub.co.uk | 19784 | 4.88 | 200 | HTML 5, English |
16975 | groundhogg.io | 19785 | 4.88 | 200 | HTML 5, English |
16976 | mathnet.ru | 19786 | 4.88 | 200 | No Lang |
16977 | onoffmix.com | 19787 | 4.88 | 200 | HTML 5 |
16978 | nextias.com | 19788 | 4.88 | 200 | HTML 5, English |
16979 | math.hawaii.edu | 19789 | 4.88 | 200 | HTML 5, English |
16980 | atmos-chem-phys.net | 19790 | 4.88 | 200 | English, Transitional |
16981 | anime-planet.com | 19791 | 4.88 | 200 | HTML 5, English |
16982 | caravancampingsales.com.au | 19792 | 4.88 | 200 | HTML 5, English |
16983 | ncmd.co.uk | 19793 | 4.88 | 200 | HTML 5, English |
16984 | kotlinconf.com | 19794 | 4.88 | 200 | HTML 5, English |
16985 | artribune.com | 19795 | 4.88 | 200 | HTML 5 |
16986 | soulshepherding.org | 19796 | 4.88 | 200 | HTML 5, English |
16987 | wpamelia.com | 19797 | 4.88 | 200 | HTML 5, English |
16988 | kongsberg.com | 19798 | 4.88 | 200 | HTML 5, English |
16989 | gaming.stackexchange.com | 19800 | 4.88 | 200 | HTML 5, English |
16990 | worldfinance.com | 19802 | 4.88 | 200 | HTML 5, English |
16991 | nkn.org | 19803 | 4.88 | 200 | HTML 5, English |
16992 | cs.utoronto.ca | 19804 | 4.88 | 200 | HTML 5, English |
16993 | donaukurier.de | 19805 | 4.88 | 200 | HTML 5 |
16994 | cra-arc.gc.ca | 19806 | 4.88 | 200 | HTML 5, English |
16995 | aaaai.org | 19807 | 4.88 | 200 | HTML 5, English |
16996 | unicefkidpower.org | 19808 | 4.88 | 200 | HTML 5, No Lang |
16997 | datastax.com | 19809 | 4.88 | 200 | HTML 5, English |
16998 | mandai.com | 19810 | 4.88 | 200 | HTML 5, English |
16999 | cs.hmc.edu | 19812 | 4.88 | 200 | HTML 5, English |
17000 | raps.org | 19813 | 4.88 | 200 | HTML 5, English |
Data from: Open PageRank