Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
4501 | lexico.com | 5327 | 5.33 | 200 | HTML 5, English |
4502 | naver.com | 5328 | 5.33 | 200 | HTML 5 |
4503 | zend.com | 5329 | 5.33 | 200 | HTML 5, English |
4504 | tpwd.texas.gov | 5330 | 5.33 | 200 | HTML 5, English |
4505 | wpastra.com | 5332 | 5.33 | 200 | HTML 5, English |
4506 | r-project.org | 5334 | 5.33 | 200 | HTML 5, English |
4507 | thenationalnews.com | 5335 | 5.33 | 200 | HTML 5, English |
4508 | embed.ly | 5336 | 5.33 | 200 | HTML 5, English |
4509 | fdc.nal.usda.gov | 5337 | 5.33 | 200 | HTML 5, English |
4510 | budgetbytes.com | 5338 | 5.33 | 200 | HTML 5, English |
4511 | profile.ameba.jp | 5339 | 5.33 | 200 | HTML 5 |
4512 | whova.com | 5340 | 5.33 | 200 | HTML 5, English |
4513 | aclweb.org | 5341 | 5.33 | 200 | English |
4514 | element.io | 5342 | 5.33 | 200 | No Lang |
4515 | happycow.net | 5343 | 5.33 | 200 | HTML 5, English |
4516 | holytrinityorthodox.com | 5344 | 5.33 | 200 | No Lang, Transitional |
4517 | baynews9.com | 5345 | 5.33 | 200 | HTML 5, English |
4518 | weblineindia.com | 5346 | 5.33 | 200 | HTML 5, English |
4519 | cash.me | 5347 | 5.33 | 200 | HTML 5, English |
4520 | flowcode.com | 5348 | 5.33 | 200 | HTML 5, English |
4521 | post.japanpost.jp | 5349 | 5.33 | 200 | HTML 5 |
4522 | pandas.pydata.org | 5350 | 5.33 | 200 | HTML 5, No Lang |
4523 | pix11.com | 5351 | 5.33 | 200 | HTML 5, English |
4524 | cs.cornell.edu | 5352 | 5.33 | 200 | HTML 5, English |
4525 | active.com | 5354 | 5.33 | 200 | HTML 5, English |
4526 | garanteprivacy.it | 5355 | 5.33 | 200 | HTML 5 |
4527 | careers.google.com | 5356 | 5.33 | 200 | HTML 5, English |
4528 | gpg4win.org | 5357 | 5.33 | 200 | No Lang, Strict |
4529 | time.is | 5358 | 5.33 | 200 | HTML 5, English |
4530 | rti.org | 5359 | 5.33 | 200 | HTML 5, English |
4531 | sonarsource.com | 5360 | 5.33 | 200 | HTML 5, English |
4532 | threema.ch | 5361 | 5.33 | 200 | HTML 5, English |
4533 | convinceandconvert.com | 5362 | 5.33 | 200 | HTML 5, English |
4534 | datacenterknowledge.com | 5363 | 5.33 | 200 | HTML 5, English |
4535 | nespresso.com | 5365 | 5.33 | 200 | HTML 5, English |
4536 | en.wikiquote.org | 5366 | 5.33 | 200 | HTML 5, No Lang |
4537 | radar.oreilly.com | 5367 | 5.33 | 200 | HTML 5, English |
4538 | merchants.google.com | 5369 | 5.33 | 200 | HTML 5, English |
4539 | psmag.com | 5371 | 5.33 | 200 | HTML 5, English |
4540 | badoo.com | 5372 | 5.33 | 200 | HTML 5, English |
4541 | evo.com | 5374 | 5.33 | 200 | HTML 5, English |
4542 | shows.acast.com | 5375 | 5.33 | 200 | HTML 5, English |
4543 | rochester.edu | 5376 | 5.33 | 200 | HTML 5, English |
4544 | datawrapper.dwcdn.net | 5377 | 5.33 | 200 | No Lang |
4545 | pacsun.com | 5378 | 5.33 | 200 | HTML 5, English |
4546 | google.github.io | 5379 | 5.33 | 200 | HTML 5, No Lang |
4547 | lifehacker.com.au | 5380 | 5.33 | 200 | HTML 5, English |
4548 | kennedyspacecenter.com | 5381 | 5.33 | 200 | HTML 5, English |
4549 | draft.blogger.com | 5384 | 5.33 | 200 | HTML 5, English |
4550 | momoyoga.com | 5385 | 5.33 | 200 | HTML 5, English |
4551 | google.com.vn | 5386 | 5.33 | 200 | HTML 5, English |
4552 | aeon.co | 5388 | 5.33 | 200 | HTML 5, English |
4553 | globaltimes.cn | 5389 | 5.33 | 200 | HTML 5, English |
4554 | poedit.net | 5392 | 5.33 | 200 | HTML 5, English |
4555 | cloudup.com | 5393 | 5.33 | 200 | HTML 5, No Lang |
4556 | breuninger.com | 5394 | 5.33 | 200 | HTML 5, English |
4557 | losangeles.cbslocal.com | 5395 | 5.33 | 200 | HTML 5, English |
4558 | gov.mb.ca | 5397 | 5.33 | 200 | HTML 5, English |
4559 | clockify.me | 5398 | 5.33 | 200 | HTML 5, No Lang |
4560 | pinterest.ch | 5400 | 5.33 | 200 | HTML 5, English |
4561 | appadvice.com | 5401 | 5.33 | 200 | HTML 5, No Lang |
4562 | tradingeconomics.com | 5402 | 5.33 | 200 | HTML 5, No Lang |
4563 | sebrae.com.br | 5403 | 5.33 | 200 | HTML 5 |
4564 | idokep.hu | 5404 | 5.33 | 200 | HTML 5 |
4565 | ru.linkedin.com | 5405 | 5.33 | 200 | HTML 5 |
4566 | icons.getbootstrap.com | 5406 | 5.33 | 200 | HTML 5, English |
4567 | haaretz.co.il | 5408 | 5.33 | 200 | HTML 5 |
4568 | google-latlong.blogspot.com | 5409 | 5.33 | 200 | HTML 5, English |
4569 | grafana.com | 5410 | 5.33 | 200 | HTML 5, English |
4570 | 21.edu.ar | 5411 | 5.33 | 200 | HTML 5 |
4571 | yankodesign.com | 5412 | 5.33 | 200 | HTML 5, English |
4572 | it.pinterest.com | 5413 | 5.33 | 200 | HTML 5, English |
4573 | uptodate.com | 5414 | 5.33 | 200 | HTML 5, No Lang |
4574 | cloud.tencent.com | 5415 | 5.33 | 200 | HTML 5 |
4575 | onezero.medium.com | 5416 | 5.33 | 200 | HTML 5, English |
4576 | encyclopedia.com | 5417 | 5.33 | 200 | HTML 5, English |
4577 | bitcatcha.com | 5418 | 5.33 | 200 | HTML 5, English |
4578 | chicago.cbslocal.com | 5419 | 5.33 | 200 | HTML 5, English |
4579 | raphkoster.com | 5421 | 5.33 | 200 | HTML 5, English |
4580 | huffingtonpost.ca | 5422 | 5.33 | 200 | HTML 5, English |
4581 | ally.com | 5423 | 5.33 | 200 | HTML 5, English |
4582 | sympla.com.br | 5424 | 5.33 | 200 | HTML 5 |
4583 | sway.office.com | 5425 | 5.33 | 200 | English |
4584 | brightside.me | 5426 | 5.33 | 200 | HTML 5, English |
4585 | fire.ca.gov | 5427 | 5.33 | 200 | HTML 5, English |
4586 | scotthelme.co.uk | 5428 | 5.33 | 200 | HTML 5, English |
4587 | themarginalian.org | 5429 | 5.33 | 200 | HTML 5, English |
4588 | usability.gov | 5430 | 5.33 | 200 | No Lang |
4589 | generatepress.com | 5431 | 5.33 | 200 | HTML 5, English |
4590 | openclassrooms.com | 5432 | 5.33 | 200 | HTML 5, English |
4591 | blogs.msdn.microsoft.com | 5433 | 5.33 | 200 | HTML 5, English |
4592 | chicagobooth.edu | 5434 | 5.33 | 200 | HTML 5, English |
4593 | apoia.se | 5435 | 5.33 | 200 | HTML 5, No Lang |
4594 | ushmm.org | 5436 | 5.33 | 200 | HTML 5, English |
4595 | synology.com | 5437 | 5.33 | 200 | HTML 5, English |
4596 | ics.uci.edu | 5438 | 5.33 | 200 | HTML 5, English |
4597 | mamamia.com.au | 5439 | 5.33 | 200 | HTML 5, English |
4598 | lexpress.fr | 5440 | 5.33 | 200 | HTML 5 |
4599 | chemistryworld.com | 5441 | 5.32 | 200 | HTML 5, English |
4600 | newatlas.com | 5442 | 5.32 | 200 | HTML 5, English |
Data from: Open PageRank