Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
11601 | adrianroselli.com | 13517 | 5.00 | 200 | HTML 5, English |
11602 | afi.com | 13519 | 5.00 | 200 | HTML 5, English |
11603 | extratv.com | 13520 | 5.00 | 200 | HTML 5, English |
11604 | agora.xtec.cat | 13522 | 5.00 | 200 | HTML 5 |
11605 | pubpub.org | 13523 | 5.00 | 200 | HTML 5, English |
11606 | banque-france.fr | 13524 | 5.00 | 200 | HTML 5 |
11607 | news.shopify.com | 13525 | 5.00 | 200 | HTML 5, English |
11608 | heavens-above.com | 13526 | 5.00 | 200 | HTML 5, English |
11609 | bradenton.com | 13527 | 5.00 | 200 | HTML 5, English |
11610 | akaunting.com | 13528 | 5.00 | 200 | HTML 5, English |
11611 | blog.scoutingmagazine.org | 13529 | 5.00 | 200 | HTML 5, English |
11612 | gemalto.com | 13530 | 5.00 | 200 | HTML 5, English |
11613 | asiae.co.kr | 13531 | 5.00 | 200 | HTML 5 |
11614 | articulo.mercadolibre.com.ar | 13532 | 5.00 | 200 | HTML 5 |
11615 | podcast.ausha.co | 13533 | 5.00 | 200 | HTML 5, English |
11616 | poket.com | 13534 | 5.00 | 200 | HTML 5, English |
11617 | restaurantguru.com | 13535 | 5.00 | 200 | HTML 5, English |
11618 | jimmyjohns.com | 13536 | 5.00 | 200 | HTML 5, English |
11619 | amsterdam-dance-event.nl | 13537 | 5.00 | 200 | HTML 5, English |
11620 | vk.me | 13539 | 5.00 | 200 | HTML 5, English |
11621 | simplified.com | 13540 | 5.00 | 200 | HTML 5, English |
11622 | codeless.co | 13541 | 5.00 | 200 | HTML 5, English |
11623 | uxmatters.com | 13543 | 5.00 | 200 | HTML 5, No Lang |
11624 | memphistravel.com | 13546 | 5.00 | 200 | HTML 5, English |
11625 | master-addons.com | 13547 | 5.00 | 200 | HTML 5, English |
11626 | careem.com | 13548 | 5.00 | 200 | HTML 5, English |
11627 | 7span.com | 13549 | 5.00 | 200 | HTML 5, English |
11628 | apraamcos.com.au | 13550 | 5.00 | 200 | HTML 5, English |
11629 | poliziadistato.it | 13551 | 5.00 | 200 | HTML 5 |
11630 | animalnewyork.com | 13552 | 5.00 | 200 | HTML 5, English |
11631 | it-daily.net | 13553 | 5.00 | 200 | HTML 5 |
11632 | aaa.si.edu | 13554 | 5.00 | 200 | HTML 5, English |
11633 | google.is | 13555 | 5.00 | 200 | HTML 5, English |
11634 | bruegel.org | 13556 | 5.00 | 200 | HTML 5, English |
11635 | miaminewtimes.com | 13557 | 5.00 | 200 | HTML 5, English |
11636 | colororacle.org | 13559 | 5.00 | 200 | No Lang, Strict |
11637 | hackage.haskell.org | 13560 | 5.00 | 200 | HTML 5, No Lang |
11638 | libertymutual.com | 13561 | 5.00 | 200 | HTML 5, English |
11639 | sixthtone.com | 13563 | 5.00 | 200 | HTML 5, No Lang |
11640 | freenode.net | 13564 | 5.00 | 200 | No Lang |
11641 | 4chan.org | 13565 | 5.00 | 200 | No Lang, Strict |
11642 | fiserv.com | 13566 | 5.00 | 200 | HTML 5, English |
11643 | aeroportodinapoli.it | 13567 | 5.00 | 200 | HTML 5 |
11644 | dfrobot.com | 13568 | 5.00 | 200 | HTML 5, English |
11645 | pib.gov.in | 13569 | 5.00 | 200 | HTML 5, No Lang |
11646 | bankwest.com.au | 13570 | 5.00 | 200 | No Lang |
11647 | msri.org | 13571 | 5.00 | 200 | HTML 5, English |
11648 | finder.com.au | 13572 | 5.00 | 200 | HTML 5, English |
11649 | shotcut.org | 13573 | 5.00 | 200 | HTML 5, No Lang |
11650 | theprint.in | 13574 | 5.00 | 200 | English |
11651 | elleshop.jp | 13575 | 5.00 | 200 | HTML 5 |
11652 | njleg.state.nj.us | 13577 | 5.00 | 200 | HTML 5, English |
11653 | clickmeeting.com | 13578 | 5.00 | 200 | HTML 5, English |
11654 | ktsm.com | 13579 | 5.00 | 200 | HTML 5, English |
11655 | ratemyprofessors.com | 13580 | 5.00 | 200 | No Lang |
11656 | shadertoy.com | 13581 | 5.00 | 200 | HTML 5, English |
11657 | glose.com | 13582 | 5.00 | 200 | HTML 5, No Lang |
11658 | soccerway.com | 13583 | 5.00 | 200 | HTML 5, English |
11659 | seco.admin.ch | 13584 | 5.00 | 200 | HTML 5 |
11660 | byu.edu | 13585 | 5.00 | 200 | HTML 5, English |
11661 | unicef.org.uk | 13586 | 5.00 | 200 | HTML 5, English |
11662 | bugsnag.com | 13587 | 5.00 | 200 | HTML 5, No Lang |
11663 | vodpod.com | 13588 | 5.00 | 200 | HTML 5, English |
11664 | jchs.harvard.edu | 13589 | 5.00 | 200 | HTML 5, English |
11665 | globalwitness.org | 13590 | 5.00 | 200 | HTML 5, English |
11666 | mariinsky.ru | 13591 | 5.00 | 200 | HTML 5 |
11667 | jornaleconomico.sapo.pt | 13592 | 5.00 | 200 | HTML 5 |
11668 | esewa.com.np | 13593 | 5.00 | 200 | HTML 5, English |
11669 | epaka.pl | 13594 | 5.00 | 200 | HTML 5 |
11670 | clubic.com | 13595 | 5.00 | 200 | HTML 5 |
11671 | stitchfix.com | 13596 | 5.00 | 200 | HTML 5, English |
11672 | artnet.com | 13598 | 5.00 | 200 | No Lang |
11673 | environment.data.gov.uk | 13599 | 5.00 | 200 | HTML 5, English |
11674 | safaribooksonline.com | 13600 | 5.00 | 200 | HTML 5, English |
11675 | starship.xyz | 13601 | 5.00 | 200 | HTML 5, English |
11676 | pimkie.fr | 13602 | 5.00 | 200 | HTML 5 |
11677 | samsungsds.com | 13603 | 5.00 | 200 | English |
11678 | usopen.org | 13604 | 5.00 | 200 | HTML 5, No Lang |
11679 | bukalapak.com | 13605 | 5.00 | 200 | HTML 5, No Lang |
11680 | icesi.edu.co | 13606 | 5.00 | 200 | HTML 5 |
11681 | computerlanguage.com | 13608 | 5.00 | 200 | No Lang |
11682 | downloads.wordpress.org | 13609 | 5.00 | 200 | HTML 5, English |
11683 | heathrow.com | 13611 | 5.00 | 200 | HTML 5, English |
11684 | mea.gov.in | 13612 | 5.00 | 200 | HTML 5, English |
11685 | flip.it | 13613 | 5.00 | 200 | HTML 5, English |
11686 | inf.ethz.ch | 13614 | 5.00 | 200 | HTML 5, English |
11687 | food.ndtv.com | 13615 | 5.00 | 200 | HTML 5, No Lang |
11688 | newsinfo.inquirer.net | 13617 | 5.00 | 200 | HTML 5, English |
11689 | osm.org | 13618 | 5.00 | 200 | HTML 5, English |
11690 | paysafe.com | 13620 | 5.00 | 200 | HTML 5, English |
11691 | criptonoticias.com | 13622 | 5.00 | 200 | HTML 5 |
11692 | riteaid.com | 13623 | 5.00 | 200 | HTML 5, English |
11693 | fff.fr | 13624 | 5.00 | 200 | HTML 5, No Lang |
11694 | issuewire.com | 13625 | 5.00 | 200 | HTML 5, English |
11695 | styleblueprint.com | 13626 | 5.00 | 200 | HTML 5, English |
11696 | hinduismtoday.com | 13627 | 5.00 | 200 | HTML 5, English |
11697 | roymorgan.com | 13628 | 5.00 | 200 | HTML 5, No Lang |
11698 | ifc.com | 13632 | 5.00 | 200 | HTML 5, English |
11699 | thesession.org | 13633 | 5.00 | 200 | HTML 5, English |
11700 | thefashionspot.com | 13634 | 5.00 | 200 | HTML 5, English |
Data from: Open PageRank