Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
5201 | spektrum.de | 6136 | 5.28 | 200 | HTML 5 |
5202 | uh.edu | 6137 | 5.28 | 200 | HTML 5, English |
5203 | data.consilium.europa.eu | 6138 | 5.28 | 200 | No Lang |
5204 | accessibe.com | 6139 | 5.28 | 200 | HTML 5, English |
5205 | circleid.com | 6140 | 5.28 | 200 | English, Transitional |
5206 | people.ischool.berkeley.edu | 6141 | 5.28 | 200 | HTML 5, No Lang |
5207 | plymouth.ac.uk | 6144 | 5.28 | 200 | HTML 5, English |
5208 | kijiji.ca | 6145 | 5.28 | 200 | HTML 5, English |
5209 | cloud.mail.ru | 6146 | 5.28 | 200 | HTML 5, English |
5210 | monde-diplomatique.fr | 6147 | 5.28 | 200 | HTML 5 |
5211 | watoday.com.au | 6149 | 5.28 | 200 | HTML 5, English |
5212 | data.gov | 6150 | 5.28 | 200 | HTML 5, English |
5213 | dict.leo.org | 6151 | 5.28 | 200 | HTML 5, English |
5214 | expressen.se | 6152 | 5.28 | 200 | HTML 5 |
5215 | documentcloud.adobe.com | 6153 | 5.27 | 200 | HTML 5, No Lang |
5216 | pocket-lint.com | 6154 | 5.27 | 200 | HTML 5, English |
5217 | panic.com | 6155 | 5.27 | 200 | HTML 5, English |
5218 | tubefilter.com | 6156 | 5.27 | 200 | HTML 5, English |
5219 | allafrica.com | 6158 | 5.27 | 200 | HTML 5, English |
5220 | deepblue.lib.umich.edu | 6159 | 5.27 | 200 | HTML 5, English |
5221 | livingsocial.com | 6160 | 5.27 | 200 | HTML 5, English |
5222 | ebay.ca | 6161 | 5.27 | 200 | No Lang |
5223 | elnuevodia.com | 6162 | 5.27 | 200 | HTML 5 |
5224 | du.edu | 6163 | 5.27 | 200 | HTML 5, English |
5225 | ksat.com | 6165 | 5.27 | 200 | HTML 5, English |
5226 | aparat.com | 6166 | 5.27 | 200 | HTML 5 |
5227 | digistore24.com | 6167 | 5.27 | 200 | English, Strict |
5228 | new.siemens.com | 6168 | 5.27 | 200 | HTML 5, English |
5229 | greensock.com | 6169 | 5.27 | 200 | HTML 5, English |
5230 | gawker.com | 6170 | 5.27 | 200 | HTML 5, English |
5231 | nomorepass.com | 6172 | 5.27 | 200 | HTML 5 |
5232 | tattoodo.com | 6173 | 5.27 | 200 | HTML 5, English |
5233 | dexonline.ro | 6174 | 5.27 | 200 | HTML 5, No Lang |
5234 | lendingclub.com | 6175 | 5.27 | 200 | HTML 5, English |
5235 | sfist.com | 6176 | 5.27 | 200 | HTML 5, English |
5236 | personal.psu.edu | 6177 | 5.27 | 200 | No Lang |
5237 | aloyoga.com | 6178 | 5.27 | 200 | HTML 5, English |
5238 | jp.sharp | 6179 | 5.27 | 200 | HTML 5 |
5239 | paris.fr | 6180 | 5.27 | 200 | HTML 5 |
5240 | search.creativecommons.org | 6182 | 5.27 | 200 | HTML 5, English |
5241 | support.spotify.com | 6183 | 5.27 | 200 | HTML 5, English |
5242 | pokecommunity.com | 6184 | 5.27 | 200 | HTML 5, English |
5243 | twittercommunity.com | 6185 | 5.27 | 200 | HTML 5, English |
5244 | remax.com | 6186 | 5.27 | 200 | HTML 5, English |
5245 | central.wordcamp.org | 6187 | 5.27 | 200 | HTML 5, English |
5246 | hiphopdx.com | 6188 | 5.27 | 200 | HTML 5, English |
5247 | morgenpost.de | 6189 | 5.27 | 200 | HTML 5 |
5248 | brandeis.edu | 6190 | 5.27 | 200 | HTML 5, English |
5249 | login.salesforce.com | 6191 | 5.27 | 200 | English, Transitional |
5250 | gov.texas.gov | 6193 | 5.27 | 200 | HTML 5, English |
5251 | postmarkapp.com | 6194 | 5.27 | 200 | HTML 5, English |
5252 | 4pda.to | 6195 | 5.27 | 200 | HTML 5 |
5253 | fcbarcelona.com | 6196 | 5.27 | 200 | HTML 5, English |
5254 | pkg.go.dev | 6197 | 5.27 | 200 | HTML 5, English |
5255 | waveapps.com | 6199 | 5.27 | 200 | HTML 5, English |
5256 | brew.sh | 6200 | 5.27 | 200 | HTML 5, English |
5257 | console.bluemix.net | 6201 | 5.27 | 200 | HTML 5, English |
5258 | birds.cornell.edu | 6202 | 5.27 | 200 | HTML 5, English |
5259 | answerthepublic.com | 6204 | 5.27 | 200 | HTML 5, English |
5260 | ntnu.edu | 6205 | 5.27 | 200 | HTML 5, English |
5261 | japan-guide.com | 6206 | 5.27 | 200 | HTML 5, English |
5262 | ki.se | 6207 | 5.27 | 200 | HTML 5 |
5263 | socialpilot.co | 6209 | 5.27 | 200 | HTML 5, English |
5264 | insidethemagic.net | 6210 | 5.27 | 200 | HTML 5, English |
5265 | mkweb.bcgsc.ca | 6212 | 5.27 | 200 | HTML 5, English |
5266 | postcron.com | 6213 | 5.27 | 200 | HTML 5, English |
5267 | shudder.com | 6214 | 5.27 | 200 | HTML 5, English |
5268 | musicbusinessworldwide.com | 6215 | 5.27 | 200 | HTML 5, English |
5269 | road.cc | 6216 | 5.27 | 200 | English |
5270 | thoughtworks.com | 6218 | 5.27 | 200 | HTML 5, English |
5271 | whoi.edu | 6219 | 5.27 | 200 | HTML 5, English |
5272 | google.hr | 6220 | 5.27 | 200 | HTML 5, English |
5273 | planet.com | 6221 | 5.27 | 200 | HTML 5, English |
5274 | systeme.io | 6222 | 5.27 | 200 | HTML 5, English |
5275 | chartjs.org | 6223 | 5.27 | 200 | HTML 5, English |
5276 | macg.co | 6225 | 5.27 | 200 | HTML 5 |
5277 | telecompaper.com | 6226 | 5.27 | 200 | HTML 5, English |
5278 | dwd.de | 6227 | 5.27 | 200 | HTML 5 |
5279 | knightfoundation.org | 6228 | 5.27 | 200 | HTML 5, English |
5280 | wien.gv.at | 6229 | 5.27 | 200 | HTML 5 |
5281 | codesandbox.io | 6230 | 5.27 | 200 | HTML 5, English |
5282 | indiebound.org | 6231 | 5.27 | 200 | HTML 5, English |
5283 | pip.pypa.io | 6232 | 5.27 | 200 | HTML 5, English |
5284 | nbcwashington.com | 6233 | 5.27 | 200 | HTML 5, English |
5285 | augsburger-allgemeine.de | 6234 | 5.27 | 200 | HTML 5 |
5286 | undocs.org | 6235 | 5.27 | 200 | HTML 5, English |
5287 | play.acast.com | 6236 | 5.27 | 200 | HTML 5, English |
5288 | partner.microsoft.com | 6237 | 5.27 | 200 | HTML 5, English |
5289 | pinkvilla.com | 6238 | 5.27 | 200 | HTML 5, English |
5290 | onet.pl | 6239 | 5.27 | 200 | HTML 5 |
5291 | sketchapp.com | 6240 | 5.27 | 200 | HTML 5, English |
5292 | business.financialpost.com | 6241 | 5.27 | 200 | HTML 5, No Lang |
5293 | cminds.com | 6242 | 5.27 | 200 | HTML 5, English |
5294 | iris.edu | 6243 | 5.27 | 200 | HTML 5, No Lang |
5295 | ktvu.com | 6244 | 5.27 | 200 | HTML 5, English |
5296 | ingentaconnect.com | 6245 | 5.27 | 200 | HTML 5, English |
5297 | digikey.com | 6246 | 5.27 | 200 | HTML 5, English |
5298 | utorrent.com | 6247 | 5.27 | 200 | HTML 5, English |
5299 | hubzilla.org | 6248 | 5.27 | 200 | HTML 5, No Lang |
5300 | jcp.org | 6249 | 5.27 | 200 | No Lang, Transitional |
Data from: Open PageRank