Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18901 | indexoncensorship.org | 22029 | 4.86 | 200 | HTML 5, English |
18902 | minepi.com | 22030 | 4.86 | 200 | HTML 5, English |
18903 | uk.sports.yahoo.com | 22031 | 4.86 | 200 | HTML 5, English |
18904 | tvi.iol.pt | 22032 | 4.86 | 200 | HTML 5 |
18905 | rolls-royce.com | 22033 | 4.86 | 200 | HTML 5, English |
18906 | tase.co.il | 22034 | 4.86 | 200 | HTML 5 |
18907 | docs.bigbluebutton.org | 22035 | 4.86 | 200 | HTML 5, English |
18908 | photopills.com | 22036 | 4.86 | 200 | HTML 5, English |
18909 | iftf.org | 22037 | 4.86 | 200 | HTML 5, English |
18910 | mrdoob.com | 22038 | 4.86 | 200 | HTML 5, English |
18911 | wate.com | 22039 | 4.86 | 200 | HTML 5, English |
18912 | openaire.eu | 22046 | 4.86 | 200 | HTML 5, English |
18913 | ufdcimages.uflib.ufl.edu | 22047 | 4.86 | 200 | No Lang |
18914 | allgaeuer-zeitung.de | 22048 | 4.86 | 200 | HTML 5 |
18915 | javascriptweekly.com | 22050 | 4.86 | 200 | HTML 5, English |
18916 | sorgalla.com | 22051 | 4.86 | 200 | HTML 5, No Lang |
18917 | voedingscentrum.nl | 22052 | 4.86 | 200 | HTML 5 |
18918 | rti-rating.org | 22053 | 4.86 | 200 | HTML 5, English |
18919 | networkx.org | 22056 | 4.86 | 200 | HTML 5, No Lang |
18920 | scottdeluzio.com | 22057 | 4.86 | 200 | HTML 5, English |
18921 | netspotapp.com | 22058 | 4.86 | 200 | HTML 5, English |
18922 | gazette.gc.ca | 22059 | 4.86 | 200 | English |
18923 | fiware.org | 22060 | 4.86 | 200 | HTML 5, English |
18924 | compression.ru | 22061 | 4.86 | 200 | English, Strict |
18925 | bravenewcoin.com | 22062 | 4.86 | 200 | HTML 5, English |
18926 | teamlab.art | 22063 | 4.86 | 200 | HTML 5, English |
18927 | burningman.org | 22064 | 4.86 | 200 | HTML 5, English |
18928 | heartmath.com | 22065 | 4.86 | 200 | HTML 5, English |
18929 | birgun.net | 22066 | 4.86 | 200 | HTML 5 |
18930 | discoverwildlife.com | 22067 | 4.86 | 200 | HTML 5, English |
18931 | presbyterianmission.org | 22068 | 4.86 | 200 | HTML 5, English |
18932 | universosm.es | 22069 | 4.86 | 200 | HTML 5 |
18933 | lyricsmania.com | 22070 | 4.86 | 200 | HTML 5, English |
18934 | karelia.com | 22071 | 4.86 | 200 | HTML 5, English |
18935 | get.foundation | 22072 | 4.86 | 200 | HTML 5, English |
18936 | pixnio.com | 22073 | 4.86 | 200 | HTML 5, English |
18937 | winbeta.org | 22074 | 4.86 | 200 | HTML 5, English |
18938 | sport.optus.com.au | 22075 | 4.86 | 200 | HTML 5, English |
18939 | intersystems.com | 22076 | 4.86 | 200 | HTML 5, English |
18940 | eleccions.gencat.cat | 22077 | 4.86 | 200 | HTML 5 |
18941 | docs.datadoghq.com | 22078 | 4.86 | 200 | HTML 5, English |
18942 | freshwatersystems.com | 22081 | 4.86 | 200 | HTML 5, English |
18943 | acco.org | 22082 | 4.86 | 200 | HTML 5, English |
18944 | protegewiki.stanford.edu | 22084 | 4.86 | 200 | HTML 5, English |
18945 | inshorts.com | 22086 | 4.86 | 200 | HTML 5 |
18946 | dutchbros.com | 22087 | 4.86 | 200 | HTML 5, English |
18947 | math.niu.edu | 22088 | 4.86 | 200 | HTML 5, English |
18948 | ggplot2.tidyverse.org | 22089 | 4.86 | 200 | HTML 5, English |
18949 | j-cast.com | 22090 | 4.86 | 200 | HTML 5 |
18950 | mmamania.com | 22091 | 4.86 | 200 | HTML 5, English |
18951 | harrypotter.fandom.com | 22092 | 4.86 | 200 | HTML 5, English |
18952 | bus-und-bahn.de | 22093 | 4.86 | 200 | HTML 5 |
18953 | crd.org | 22095 | 4.86 | 200 | HTML 5, English |
18954 | vocalvideo.com | 22097 | 4.86 | 200 | HTML 5, English |
18955 | 1und1.de | 22098 | 4.86 | 200 | HTML 5 |
18956 | vins-rhone.com | 22100 | 4.86 | 200 | HTML 5 |
18957 | thoughtcrime.org | 22101 | 4.86 | 200 | HTML 5, English |
18958 | marketresearchfuture.com | 22102 | 4.86 | 200 | HTML 5, English |
18959 | officialcharts.com | 22103 | 4.86 | 200 | HTML 5, English |
18960 | kristeligt-dagblad.dk | 22104 | 4.86 | 200 | HTML 5 |
18961 | goo.su | 22106 | 4.86 | 200 | HTML 5, English |
18962 | bottrop.de | 22107 | 4.86 | 200 | HTML 5 |
18963 | dpdk.org | 22108 | 4.86 | 200 | HTML 5, English |
18964 | bakerhughes.com | 22109 | 4.86 | 200 | HTML 5, English |
18965 | heroicons.com | 22110 | 4.86 | 200 | HTML 5, No Lang |
18966 | movado.com | 22111 | 4.86 | 200 | HTML 5, English |
18967 | sethgodin.com | 22112 | 4.86 | 200 | English |
18968 | photomath.com | 22113 | 4.86 | 200 | HTML 5, English |
18969 | winfuture.de | 22114 | 4.86 | 200 | Transitional |
18970 | whyhunger.org | 22115 | 4.86 | 200 | HTML 5, No Lang |
18971 | demos.co.uk | 22116 | 4.86 | 200 | HTML 5, English |
18972 | espreso.tv | 22117 | 4.86 | 200 | HTML 5 |
18973 | dialup.com | 22118 | 4.86 | 200 | HTML 5, English |
18974 | deloitte.co.uk | 22119 | 4.86 | 200 | HTML 5, English |
18975 | milanoo.com | 22120 | 4.86 | 200 | HTML 5, English |
18976 | tokio.rs | 22121 | 4.86 | 200 | HTML 5, No Lang |
18977 | alzforum.org | 22122 | 4.86 | 200 | HTML 5, English |
18978 | travtasy.com | 22123 | 4.86 | 200 | HTML 5, English |
18979 | whois.arin.net | 22124 | 4.86 | 200 | No Lang, Transitional |
18980 | fhnw.ch | 22125 | 4.86 | 200 | HTML 5 |
18981 | marianne.net | 22126 | 4.86 | 200 | HTML 5 |
18982 | investor.google.com | 22127 | 4.86 | 200 | HTML 5, English |
18983 | docs.mollie.com | 22128 | 4.86 | 200 | HTML 5, English |
18984 | eadt.co.uk | 22130 | 4.86 | 200 | HTML 5, English |
18985 | ru.scribd.com | 22131 | 4.86 | 200 | HTML 5, English |
18986 | annefrank.org | 22134 | 4.86 | 200 | HTML 5, English |
18987 | travel.nytimes.com | 22135 | 4.86 | 200 | HTML 5, English |
18988 | holidaycheck.de | 22136 | 4.86 | 200 | HTML 5 |
18989 | duq.edu | 22137 | 4.86 | 200 | HTML 5, English |
18990 | seoul.co.kr | 22138 | 4.86 | 200 | HTML 5 |
18991 | slocounty.ca.gov | 22139 | 4.86 | 200 | HTML 5, English |
18992 | worldwildbrice.net | 22140 | 4.86 | 200 | HTML 5, English |
18993 | designer.microsoft.com | 22141 | 4.86 | 200 | HTML 5, English |
18994 | theengineer.co.uk | 22143 | 4.86 | 200 | HTML 5, English |
18995 | wise-qatar.org | 22144 | 4.86 | 200 | HTML 5, English |
18996 | medianama.com | 22145 | 4.86 | 200 | HTML 5, English |
18997 | nz.linkedin.com | 22146 | 4.86 | 200 | HTML 5, English |
18998 | connections-pro.com | 22147 | 4.86 | 200 | HTML 5, English |
18999 | calendar.online | 22148 | 4.86 | 200 | HTML 5, English |
19000 | software.opensuse.org | 22149 | 4.86 | 200 | HTML 5, No Lang |
Data from: Open PageRank