Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
10601 | messages.google.com | 12357 | 5.03 | 200 | HTML 5, English |
10602 | media.un.org | 12359 | 5.03 | 200 | HTML 5, English |
10603 | amacad.org | 12360 | 5.03 | 200 | HTML 5, English |
10604 | melskitchencafe.com | 12361 | 5.03 | 200 | HTML 5, English |
10605 | oik-plugins.com | 12362 | 5.03 | 200 | HTML 5, English |
10606 | fundly.com | 12363 | 5.03 | 200 | HTML 5, No Lang |
10607 | soundtrap.com | 12364 | 5.03 | 200 | HTML 5, English |
10608 | nickjr.com | 12365 | 5.03 | 200 | HTML 5, English |
10609 | goguardian.com | 12366 | 5.03 | 200 | HTML 5, English |
10610 | gov.nl.ca | 12367 | 5.03 | 200 | HTML 5, English |
10611 | ntnu.no | 12368 | 5.03 | 200 | HTML 5 |
10612 | thebeatles.com | 12369 | 5.03 | 200 | HTML 5, English |
10613 | corel.com | 12370 | 5.03 | 200 | HTML 5, English |
10614 | solidproject.org | 12371 | 5.03 | 200 | HTML 5, English |
10615 | openlab.ncl.ac.uk | 12372 | 5.03 | 200 | HTML 5, English |
10616 | clir.org | 12373 | 5.03 | 200 | HTML 5, English |
10617 | takaratomy.co.jp | 12374 | 5.03 | 200 | HTML 5 |
10618 | esmadrid.com | 12375 | 5.03 | 200 | |
10619 | toucharcade.com | 12376 | 5.03 | 200 | HTML 5, English |
10620 | documentation.onesignal.com | 12377 | 5.03 | 200 | HTML 5, English |
10621 | podtail.com | 12378 | 5.03 | 200 | HTML 5, English |
10622 | home.saxo | 12379 | 5.03 | 200 | HTML 5, English |
10623 | cuebiq.com | 12380 | 5.03 | 200 | HTML 5, English |
10624 | login.xyz | 12381 | 5.03 | 200 | HTML 5, English |
10625 | styleseat.com | 12382 | 5.03 | 200 | HTML 5, English |
10626 | uk-air.defra.gov.uk | 12383 | 5.03 | 200 | HTML 5, English |
10627 | infomoney.com.br | 12385 | 5.03 | 200 | HTML 5 |
10628 | massey.ac.nz | 12386 | 5.03 | 200 | HTML 5, English |
10629 | world.kbs.co.kr | 12387 | 5.03 | 200 | English, Transitional |
10630 | linklist.bio | 12388 | 5.03 | 200 | HTML 5, English |
10631 | applovin.com | 12389 | 5.03 | 200 | HTML 5, English |
10632 | progressive.com | 12391 | 5.03 | 200 | HTML 5, English |
10633 | apps.kde.org | 12392 | 5.03 | 200 | HTML 5, English |
10634 | blog.gitnux.com | 12395 | 5.03 | 200 | HTML 5, English |
10635 | wizards.com | 12396 | 5.03 | 200 | HTML 5, English |
10636 | adnkronos.com | 12397 | 5.03 | 200 | No Lang |
10637 | ksta.de | 12398 | 5.03 | 200 | HTML 5 |
10638 | hachyderm.io | 12399 | 5.03 | 200 | HTML 5, English |
10639 | masp.org.br | 12400 | 5.03 | 200 | HTML 5, English |
10640 | api.video | 12401 | 5.03 | 200 | HTML 5, English |
10641 | tech.dropbox.com | 12402 | 5.03 | 200 | HTML 5, English |
10642 | tripoto.com | 12403 | 5.03 | 200 | HTML 5, English |
10643 | sowetanlive.co.za | 12404 | 5.03 | 200 | HTML 5, English |
10644 | biltmore.com | 12405 | 5.03 | 200 | HTML 5, English |
10645 | confoo.ca | 12407 | 5.03 | 200 | HTML 5, English |
10646 | themeparkinsider.com | 12409 | 5.03 | 200 | HTML 5, English |
10647 | chemspider.com | 12411 | 5.03 | 200 | HTML 5, English |
10648 | chsinc.com | 12413 | 5.03 | 200 | HTML 5, No Lang |
10649 | blog.cryptographyengineering.com | 12414 | 5.03 | 200 | HTML 5, English |
10650 | mn.uio.no | 12417 | 5.03 | 200 | HTML 5 |
10651 | canalplus.com | 12418 | 5.03 | 200 | HTML 5 |
10652 | marvel.fandom.com | 12419 | 5.03 | 200 | HTML 5, English |
10653 | scholarworks.iu.edu | 12420 | 5.03 | 200 | HTML 5, English |
10654 | forbes.ru | 12421 | 5.03 | 200 | HTML 5 |
10655 | kliken.com | 12422 | 5.03 | 200 | HTML 5, English |
10656 | epsg.io | 12423 | 5.03 | 200 | HTML 5, English |
10657 | reviews.cnet.com | 12424 | 5.03 | 200 | HTML 5, English |
10658 | tisch.nyu.edu | 12425 | 5.03 | 200 | HTML 5, No Lang |
10659 | godoc.org | 12426 | 5.03 | 200 | HTML 5, English |
10660 | plan.io | 12427 | 5.03 | 200 | HTML 5, English |
10661 | sipri.org | 12428 | 5.03 | 200 | HTML 5, English |
10662 | admin.google.com | 12429 | 5.03 | 200 | HTML 5, English |
10663 | leg.state.nv.us | 12430 | 5.03 | 200 | HTML 5, English |
10664 | skoda-storyboard.com | 12431 | 5.03 | 200 | HTML 5, English |
10665 | ifthenpay.com | 12432 | 5.03 | 200 | HTML 5 |
10666 | nissan.co.uk | 12433 | 5.03 | 200 | HTML 5, English |
10667 | signup.live.com | 12434 | 5.03 | 200 | English |
10668 | teamtreehouse.com | 12435 | 5.03 | 200 | HTML 5, English |
10669 | moca.org | 12436 | 5.03 | 200 | HTML 5, English |
10670 | splitbrain.org | 12437 | 5.03 | 200 | HTML 5, English |
10671 | pirelli.com | 12438 | 5.03 | 200 | HTML 5, No Lang |
10672 | tallahassee.com | 12439 | 5.03 | 200 | HTML 5, English |
10673 | fupa.net | 12440 | 5.03 | 200 | HTML 5 |
10674 | search.com | 12443 | 5.03 | 200 | HTML 5, English |
10675 | docs.opensea.io | 12444 | 5.03 | 200 | HTML 5, English |
10676 | bmcbioinformatics.biomedcentral.com | 12445 | 5.03 | 200 | HTML 5, English |
10677 | thelocal.de | 12446 | 5.03 | 200 | HTML 5, English |
10678 | protobuf.dev | 12447 | 5.03 | 200 | HTML 5, English |
10679 | adoptapet.com | 12448 | 5.03 | 200 | HTML 5, English |
10680 | mochajs.org | 12449 | 5.03 | 200 | HTML 5, English |
10681 | fau.edu | 12450 | 5.03 | 200 | HTML 5, English |
10682 | passlogy.com | 12451 | 5.03 | 200 | HTML 5 |
10683 | veracode.com | 12452 | 5.03 | 200 | HTML 5, English |
10684 | xxx.com | 12453 | 5.03 | 200 | No Lang, Transitional |
10685 | people.duke.edu | 12455 | 5.03 | 200 | No Lang, Transitional |
10686 | weather.yahoo.co.jp | 12456 | 5.03 | 200 | HTML 5 |
10687 | lojadomecanico.com.br | 12457 | 5.03 | 200 | HTML 5 |
10688 | arageek.com | 12458 | 5.03 | 200 | HTML 5 |
10689 | justintimberlake.com | 12459 | 5.03 | 200 | HTML 5, English |
10690 | relay.fm | 12461 | 5.03 | 200 | HTML 5, English |
10691 | aau.at | 12462 | 5.03 | 200 | HTML 5 |
10692 | daad.de | 12463 | 5.03 | 200 | HTML 5 |
10693 | linfo.org | 12464 | 5.03 | 200 | No Lang, Transitional |
10694 | remitano.com | 12465 | 5.03 | 200 | HTML 5, English |
10695 | yok.gov.tr | 12466 | 5.03 | 200 | |
10696 | tvazteca.com | 12467 | 5.03 | 200 | HTML 5 |
10697 | upv.es | 12469 | 5.03 | 200 | HTML 5, English |
10698 | 2.gravatar.com | 12470 | 5.03 | 200 | HTML 5, English |
10699 | hastebin.com | 12471 | 5.03 | 200 | HTML 5, English |
10700 | us2.campaign-archive1.com | 12472 | 5.03 | 200 | No Lang |
Data from: Open PageRank