Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
4401 | itwire.com | 5217 | 5.34 | 200 | HTML 5, English |
4402 | colt.net | 5218 | 5.34 | 200 | HTML 5, English |
4403 | honeywell.com | 5219 | 5.34 | 200 | HTML 5, English |
4404 | expressandstar.com | 5220 | 5.34 | 200 | HTML 5, English |
4405 | about.ads.microsoft.com | 5221 | 5.34 | 200 | HTML 5, English |
4406 | le.ac.uk | 5222 | 5.34 | 200 | HTML 5, English |
4407 | microbit.org | 5224 | 5.34 | 200 | HTML 5, English |
4408 | ccpgames.com | 5225 | 5.34 | 200 | HTML 5, English |
4409 | knowledge.wharton.upenn.edu | 5226 | 5.34 | 200 | HTML 5, English |
4410 | glaad.org | 5227 | 5.34 | 200 | HTML 5, English |
4411 | essex.ac.uk | 5228 | 5.34 | 200 | HTML 5, English |
4412 | golang.org | 5229 | 5.34 | 200 | HTML 5, English |
4413 | hbomax.com | 5230 | 5.34 | 200 | HTML 5, English |
4414 | cnrs.fr | 5232 | 5.34 | 200 | HTML 5 |
4415 | finance.sina.com.cn | 5233 | 5.34 | 200 | HTML 5, No Lang |
4416 | shipstation.com | 5234 | 5.34 | 200 | HTML 5, English |
4417 | kdp.amazon.com | 5235 | 5.34 | 200 | HTML 5, English |
4418 | chromereleases.googleblog.com | 5236 | 5.34 | 200 | HTML 5, English |
4419 | ppg.com | 5237 | 5.34 | 200 | HTML 5, English |
4420 | informit.com | 5238 | 5.34 | 200 | HTML 5, No Lang |
4421 | faire.com | 5239 | 5.34 | 200 | HTML 5, English |
4422 | atsdr.cdc.gov | 5240 | 5.34 | 200 | HTML 5, English |
4423 | thersa.org | 5241 | 5.34 | 200 | HTML 5, English |
4424 | teamup.com | 5242 | 5.34 | 200 | HTML 5, English |
4425 | hal.science | 5243 | 5.34 | 200 | HTML 5, English |
4426 | tedbaker.com | 5244 | 5.34 | 200 | HTML 5, English |
4427 | support.skype.com | 5247 | 5.34 | 200 | HTML 5, English |
4428 | abc13.com | 5248 | 5.34 | 200 | HTML 5, English |
4429 | nationalinterest.org | 5249 | 5.34 | 200 | HTML 5, English |
4430 | artmajeur.com | 5250 | 5.34 | 200 | HTML 5, English |
4431 | trthaber.com | 5251 | 5.34 | 200 | HTML 5 |
4432 | uccs.edu | 5252 | 5.34 | 200 | HTML 5, English |
4433 | vidyard.com | 5253 | 5.34 | 200 | HTML 5, English |
4434 | taste.com.au | 5254 | 5.34 | 200 | HTML 5, English |
4435 | eventbrite.it | 5255 | 5.34 | 200 | HTML 5, No Lang |
4436 | amplify.com | 5256 | 5.34 | 200 | HTML 5, English |
4437 | whois.com | 5257 | 5.34 | 200 | HTML 5, English |
4438 | johnmacfarlane.net | 5258 | 5.34 | 200 | HTML 5, English |
4439 | br.linkedin.com | 5259 | 5.34 | 200 | HTML 5 |
4440 | ccohs.ca | 5260 | 5.34 | 200 | HTML 5, English |
4441 | scouting.org | 5261 | 5.34 | 200 | HTML 5, English |
4442 | healthgrades.com | 5262 | 5.34 | 200 | HTML 5, English |
4443 | hover.com | 5263 | 5.34 | 200 | HTML 5, No Lang |
4444 | nearpod.com | 5264 | 5.34 | 200 | HTML 5, English |
4445 | journal.frontiersin.org | 5265 | 5.34 | 200 | HTML 5, English |
4446 | binged.it | 5266 | 5.34 | 200 | HTML 5, English |
4447 | usa.kaspersky.com | 5267 | 5.34 | 200 | HTML 5, English |
4448 | nsw.gov.au | 5268 | 5.34 | 200 | HTML 5, English |
4449 | wondery.com | 5269 | 5.34 | 200 | HTML 5, English |
4450 | vlaanderen.be | 5270 | 5.34 | 200 | HTML 5 |
4451 | architecture.com | 5271 | 5.34 | 200 | HTML 5, English |
4452 | nh.gov | 5272 | 5.34 | 200 | HTML 5, English |
4453 | fliphtml5.com | 5273 | 5.34 | 200 | HTML 5, No Lang |
4454 | wevideo.com | 5275 | 5.34 | 200 | HTML 5, English |
4455 | estadao.com.br | 5276 | 5.34 | 200 | HTML 5 |
4456 | ars.usda.gov | 5277 | 5.34 | 200 | HTML 5, No Lang |
4457 | databricks.com | 5278 | 5.34 | 200 | HTML 5, English |
4458 | pcloud.com | 5279 | 5.34 | 200 | HTML 5, English |
4459 | bundesbank.de | 5280 | 5.34 | 200 | HTML 5, English |
4460 | amazon.jobs | 5281 | 5.34 | 200 | HTML 5, English |
4461 | rogerebert.com | 5282 | 5.34 | 200 | HTML 5, English |
4462 | sussex.ac.uk | 5283 | 5.34 | 200 | HTML 5, English |
4463 | which.co.uk | 5284 | 5.34 | 200 | HTML 5, English |
4464 | comicskingdom.com | 5285 | 5.34 | 200 | HTML 5, English |
4465 | polar.com | 5287 | 5.34 | 200 | HTML 5, English |
4466 | clio.com | 5288 | 5.34 | 200 | HTML 5, English |
4467 | site.com | 5289 | 5.34 | 200 | HTML 5, English |
4468 | hottopic.com | 5290 | 5.34 | 200 | HTML 5, No Lang |
4469 | lightinthebox.com | 5291 | 5.34 | 200 | HTML 5, English |
4470 | health.usnews.com | 5292 | 5.34 | 200 | HTML 5, English |
4471 | wi-fi.org | 5293 | 5.34 | 200 | HTML 5, English |
4472 | section508.gov | 5294 | 5.34 | 200 | HTML 5, English |
4473 | brit.co | 5295 | 5.34 | 200 | HTML 5, English |
4474 | 1stdibs.com | 5296 | 5.34 | 200 | HTML 5, English |
4475 | marxists.org | 5297 | 5.34 | 200 | HTML 5, English |
4476 | match.com | 5298 | 5.34 | 200 | HTML 5, No Lang |
4477 | justgetflux.com | 5299 | 5.34 | 200 | HTML 5, English |
4478 | fifa.com | 5300 | 5.34 | 200 | HTML 5, English |
4479 | ancient-origins.net | 5301 | 5.34 | 200 | HTML 5, English |
4480 | potterybarn.com | 5302 | 5.34 | 200 | HTML 5, English |
4481 | globalsign.com | 5303 | 5.34 | 200 | HTML 5, English |
4482 | usgbc.org | 5304 | 5.34 | 200 | HTML 5, English |
4483 | boohoo.com | 5305 | 5.34 | 200 | HTML 5, English |
4484 | ecologie.gouv.fr | 5306 | 5.34 | 200 | HTML 5 |
4485 | fr.wordpress.org | 5307 | 5.34 | 200 | HTML 5 |
4486 | listennotes.com | 5309 | 5.34 | 200 | HTML 5, English |
4487 | posit.co | 5310 | 5.33 | 200 | HTML 5, English |
4488 | mixi.jp | 5312 | 5.33 | 200 | HTML 5 |
4489 | nrc.nl | 5313 | 5.33 | 200 | HTML 5 |
4490 | news.artnet.com | 5314 | 5.33 | 200 | HTML 5, English |
4491 | mars.nasa.gov | 5315 | 5.33 | 200 | HTML 5, English |
4492 | buenosaires.gob.ar | 5317 | 5.33 | 200 | HTML 5 |
4493 | swpc.noaa.gov | 5318 | 5.33 | 200 | HTML 5, English |
4494 | usc.edu | 5319 | 5.33 | 200 | HTML 5, English |
4495 | state.nj.us | 5320 | 5.33 | 200 | HTML 5, English |
4496 | webmaster.yandex.ru | 5321 | 5.33 | 200 | HTML 5 |
4497 | web.telegram.org | 5322 | 5.33 | 200 | HTML 5, English |
4498 | sucuri.net | 5323 | 5.33 | 200 | HTML 5, English |
4499 | demorgen.be | 5325 | 5.33 | 200 | HTML 5 |
4500 | middleeasteye.net | 5326 | 5.33 | 200 | HTML 5, English |
Data from: Open PageRank