Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
17101 | waterdata.usgs.gov | 19931 | 4.88 | 200 | HTML 5, English |
17102 | docs.bazel.build | 19932 | 4.88 | 200 | No Lang |
17103 | hungerstation.com | 19934 | 4.88 | 200 | HTML 5 |
17104 | bundeswehr.de | 19935 | 4.88 | 200 | HTML 5 |
17105 | pedidosya.com | 19936 | 4.88 | 200 | HTML 5 |
17106 | share.transistor.fm | 19938 | 4.88 | 200 | No Lang |
17107 | bandwidth.com | 19939 | 4.88 | 200 | HTML 5, English |
17108 | handelszeitung.ch | 19940 | 4.88 | 200 | HTML 5 |
17109 | isrctn.com | 19941 | 4.88 | 200 | HTML 5, English |
17110 | themes.trac.wordpress.org | 19942 | 4.88 | 200 | No Lang, Strict |
17111 | realtimerendering.com | 19944 | 4.88 | 200 | No Lang, Transitional |
17112 | danga.com | 19945 | 4.88 | 200 | No Lang |
17113 | norml.org | 19947 | 4.88 | 200 | HTML 5, English |
17114 | turfjs.org | 19948 | 4.88 | 200 | HTML 5, English |
17115 | visitsaudi.com | 19952 | 4.88 | 200 | HTML 5, English |
17116 | brandfolder.com | 19953 | 4.88 | 200 | HTML 5, English |
17117 | dft.gov.uk | 19955 | 4.88 | 200 | HTML 5, English |
17118 | twentytwowords.com | 19956 | 4.88 | 200 | HTML 5, English |
17119 | littlealchemy2.com | 19957 | 4.88 | 200 | HTML 5, English |
17120 | mitsubishicars.com | 19958 | 4.88 | 200 | HTML 5, English |
17121 | earth.com | 19959 | 4.88 | 200 | HTML 5, English |
17122 | holland.com | 19960 | 4.88 | 200 | No Lang |
17123 | apuntmedia.es | 19961 | 4.88 | 200 | HTML 5 |
17124 | mirillis.com | 19963 | 4.88 | 200 | HTML 5, English |
17125 | bigquery.cloud.google.com | 19964 | 4.88 | 200 | HTML 5, English |
17126 | kusi.com | 19965 | 4.88 | 200 | HTML 5, English |
17127 | janes.com | 19966 | 4.88 | 200 | HTML 5, English |
17128 | jyu.fi | 19967 | 4.88 | 200 | HTML 5 |
17129 | entwickler.de | 19968 | 4.88 | 200 | HTML 5 |
17130 | jenis.com | 19969 | 4.88 | 200 | HTML 5, English |
17131 | 911memorial.org | 19970 | 4.88 | 200 | HTML 5, English |
17132 | commoncause.org | 19972 | 4.88 | 200 | HTML 5, English |
17133 | signingsavvy.com | 19973 | 4.88 | 200 | HTML 5, English |
17134 | easel.ly | 19975 | 4.88 | 200 | HTML 5, English |
17135 | hexagon.com | 19976 | 4.88 | 200 | HTML 5, English |
17136 | nativecos.com | 19977 | 4.88 | 200 | HTML 5, No Lang |
17137 | starwars.wikia.com | 19978 | 4.88 | 200 | HTML 5, English |
17138 | ischool.umd.edu | 19979 | 4.88 | 200 | HTML 5, English |
17139 | vr.fi | 19980 | 4.88 | 200 | HTML 5 |
17140 | cidoc-crm.org | 19982 | 4.88 | 200 | HTML 5, English |
17141 | lumc.nl | 19983 | 4.88 | 200 | HTML 5 |
17142 | sib.swiss | 19984 | 4.88 | 200 | HTML 5, English |
17143 | alcom.ax | 19985 | 4.88 | 200 | HTML 5 |
17144 | group.kadokawa.co.jp | 19986 | 4.88 | 200 | HTML 5 |
17145 | nhk.jp | 19987 | 4.88 | 200 | HTML 5 |
17146 | ageconsearch.umn.edu | 19988 | 4.88 | 200 | HTML 5, English |
17147 | openphilanthropy.org | 19992 | 4.88 | 200 | HTML 5, English |
17148 | forum.teamspeak.com | 19993 | 4.88 | 200 | HTML 5, English |
17149 | aidsmap.com | 19994 | 4.88 | 200 | HTML 5, English |
17150 | traveloka.com | 19995 | 4.88 | 200 | HTML 5, English |
17151 | culturecommunication.gouv.fr | 19996 | 4.88 | 200 | HTML 5 |
17152 | instawp.com | 19997 | 4.88 | 200 | HTML 5, English |
17153 | nationalobserver.com | 19998 | 4.88 | 200 | HTML 5, English |
17154 | blog.litespeedtech.com | 20001 | 4.88 | 200 | HTML 5, English |
17155 | hagerty.com | 20002 | 4.88 | 200 | HTML 5, English |
17156 | gossamer-threads.com | 20003 | 4.88 | 200 | HTML 5, English |
17157 | hive.blog | 20004 | 4.88 | 200 | HTML 5, English |
17158 | autozone.com | 20005 | 4.88 | 200 | HTML 5, English |
17159 | finance.si | 20006 | 4.88 | 200 | HTML 5 |
17160 | hbz-nrw.de | 20007 | 4.88 | 200 | HTML 5, English |
17161 | oberlin.edu | 20008 | 4.88 | 200 | HTML 5, English |
17162 | infohub.nyced.org | 20009 | 4.88 | 200 | HTML 5, English |
17163 | seeker.com | 20011 | 4.88 | 200 | HTML 5, English |
17164 | news.ontario.ca | 20013 | 4.88 | 200 | HTML 5, English |
17165 | globalratings.com | 20014 | 4.88 | 200 | No Lang, Transitional |
17166 | guokr.com | 20015 | 4.88 | 200 | HTML 5, English |
17167 | testanything.org | 20016 | 4.88 | 200 | HTML 5, No Lang |
17168 | columbiaspectator.com | 20017 | 4.88 | 200 | HTML 5, No Lang |
17169 | diglib.eg.org | 20018 | 4.88 | 200 | HTML 5, English |
17170 | pkp.sfu.ca | 20019 | 4.88 | 200 | HTML 5, English |
17171 | de.euronews.com | 20020 | 4.88 | 200 | HTML 5 |
17172 | publications.lib.chalmers.se | 20021 | 4.88 | 200 | |
17173 | nioz.nl | 20022 | 4.88 | 200 | HTML 5, English |
17174 | blog.trailofbits.com | 20023 | 4.88 | 200 | HTML 5, English |
17175 | yalelawjournal.org | 20024 | 4.88 | 200 | HTML 5, No Lang |
17176 | nt.gov.au | 20025 | 4.88 | 200 | HTML 5, English |
17177 | 72.ru | 20026 | 4.88 | 200 | HTML 5 |
17178 | legistar.council.nyc.gov | 20027 | 4.88 | 200 | English, Transitional |
17179 | coinex.com | 20028 | 4.88 | 200 | HTML 5, English |
17180 | termux.com | 20030 | 4.88 | 200 | HTML 5, English |
17181 | carbonhealth.com | 20031 | 4.88 | 200 | HTML 5, English |
17182 | christophm.github.io | 20033 | 4.88 | 200 | HTML 5, English |
17183 | mnufc.com | 20034 | 4.88 | 200 | HTML 5, English |
17184 | mtxc.eu | 20035 | 4.88 | 200 | HTML 5, No Lang |
17185 | trendsmap.com | 20036 | 4.88 | 200 | HTML 5, No Lang |
17186 | lcamtuf.coredump.cx | 20037 | 4.88 | 200 | No Lang |
17187 | store.posimyth.com | 20038 | 4.88 | 200 | HTML 5, English |
17188 | opengameart.org | 20039 | 4.88 | 200 | English |
17189 | yenicaggazetesi.com.tr | 20040 | 4.88 | 200 | HTML 5 |
17190 | sweden.se | 20042 | 4.88 | 200 | HTML 5, English |
17191 | search.maven.org | 20044 | 4.88 | 200 | No Lang |
17192 | iseecars.com | 20045 | 4.88 | 200 | HTML 5, English |
17193 | humanwhocodes.com | 20046 | 4.88 | 200 | HTML 5, English |
17194 | breez.technology | 20047 | 4.88 | 200 | HTML 5, English |
17195 | math.uic.edu | 20049 | 4.88 | 200 | HTML 5, English |
17196 | aan.com | 20050 | 4.88 | 200 | HTML 5, English |
17197 | chef.io | 20051 | 4.88 | 200 | HTML 5, English |
17198 | q-dance.com | 20052 | 4.88 | 200 | HTML 5, No Lang |
17199 | clarionledger.com | 20053 | 4.88 | 200 | HTML 5, English |
17200 | cvdazzle.com | 20054 | 4.88 | 200 | HTML 5, English |
Data from: Open PageRank