Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
17601 | infolific.com | 20532 | 4.88 | 200 | English, Transitional |
17602 | afrotech.com | 20533 | 4.88 | 200 | HTML 5, English |
17603 | wiki.hyperledger.org | 20535 | 4.88 | 200 | HTML 5, No Lang |
17604 | evolution.berkeley.edu | 20536 | 4.88 | 200 | HTML 5, English |
17605 | mc4wp.com | 20537 | 4.88 | 200 | HTML 5, English |
17606 | klick-tipp.com | 20538 | 4.88 | 200 | HTML 5 |
17607 | sonntagsblatt.de | 20539 | 4.88 | 200 | HTML 5 |
17608 | oschina.net | 20540 | 4.88 | 200 | HTML 5 |
17609 | sciencelearn.org.nz | 20541 | 4.88 | 200 | HTML 5, English |
17610 | ubergizmo.com | 20542 | 4.88 | 200 | HTML 5, English |
17611 | eltonjohn.com | 20543 | 4.88 | 200 | HTML 5, English |
17612 | torontozoo.com | 20544 | 4.88 | 200 | HTML 5, English |
17613 | cmsimpact.org | 20545 | 4.88 | 200 | HTML 5, English |
17614 | pinterest.se | 20546 | 4.88 | 200 | HTML 5, English |
17615 | snf.ch | 20547 | 4.88 | 200 | HTML 5 |
17616 | v2ex.com | 20548 | 4.88 | 200 | HTML 5 |
17617 | ecrater.com | 20549 | 4.88 | 200 | HTML 5, English |
17618 | caregiver.org | 20550 | 4.88 | 200 | HTML 5, English |
17619 | cbsa-asfc.gc.ca | 20551 | 4.88 | 200 | HTML 5, English |
17620 | whonix.org | 20552 | 4.88 | 200 | HTML 5, English |
17621 | psc.edu | 20553 | 4.88 | 200 | HTML 5, English |
17622 | microcontentnews.com | 20554 | 4.88 | 200 | HTML 5 |
17623 | gordonramsayrestaurants.com | 20556 | 4.88 | 200 | HTML 5, English |
17624 | yourshot.nationalgeographic.com | 20558 | 4.88 | 200 | HTML 5, English |
17625 | letv.com | 20559 | 4.88 | 200 | HTML 5, No Lang |
17626 | bulkwp.com | 20560 | 4.88 | 200 | HTML 5, English |
17627 | blog.mozilla.com | 20562 | 4.88 | 200 | HTML 5, English |
17628 | cogeo.org | 20563 | 4.88 | 200 | HTML 5, No Lang |
17629 | canterbury.ac.nz | 20564 | 4.88 | 200 | HTML 5, English |
17630 | cultureamp.com | 20565 | 4.88 | 200 | HTML 5, English |
17631 | unifr.ch | 20566 | 4.88 | 200 | No Lang |
17632 | knitty.com | 20567 | 4.88 | 200 | HTML 5, English |
17633 | translate.google.co.jp | 20568 | 4.88 | 200 | HTML 5 |
17634 | washingtonwine.org | 20569 | 4.88 | 200 | HTML 5, English |
17635 | wtkr.com | 20570 | 4.88 | 200 | HTML 5, English |
17636 | optiv.com | 20571 | 4.88 | 200 | HTML 5, English |
17637 | sciencecommons.org | 20572 | 4.88 | 200 | HTML 5, English |
17638 | helmholtz.de | 20573 | 4.88 | 200 | HTML 5 |
17639 | iuhealth.org | 20574 | 4.88 | 200 | HTML 5, English |
17640 | tfm.co.jp | 20575 | 4.88 | 200 | HTML 5 |
17641 | members.aol.com | 20576 | 4.88 | 200 | HTML 5, English |
17642 | soapui.org | 20577 | 4.88 | 200 | HTML 5, No Lang |
17643 | zoop.com.br | 20578 | 4.88 | 200 | HTML 5 |
17644 | hotpepper.jp | 20579 | 4.88 | 200 | Strict |
17645 | ottawa.ctvnews.ca | 20581 | 4.88 | 200 | HTML 5, English |
17646 | toogoodtogo.com | 20582 | 4.88 | 200 | HTML 5, English |
17647 | tcrf.net | 20583 | 4.88 | 200 | HTML 5, English |
17648 | mdc-berlin.de | 20584 | 4.88 | 200 | HTML 5, English |
17649 | examine.com | 20585 | 4.88 | 200 | HTML 5, English |
17650 | eab.abime.net | 20586 | 4.88 | 200 | English, Transitional |
17651 | asc-csa.gc.ca | 20587 | 4.88 | 200 | HTML 5 |
17652 | easin.jrc.ec.europa.eu | 20589 | 4.88 | 200 | HTML 5, English |
17653 | vidiq.com | 20590 | 4.88 | 200 | HTML 5, English |
17654 | tech.sina.com.cn | 20591 | 4.88 | 200 | HTML 5, No Lang |
17655 | magenta.tensorflow.org | 20592 | 4.88 | 200 | HTML 5, English |
17656 | cfo.com | 20593 | 4.88 | 200 | HTML 5, English |
17657 | operations.osmfoundation.org | 20594 | 4.88 | 200 | HTML 5, No Lang |
17658 | sip.gouvernement.lu | 20595 | 4.88 | 200 | HTML 5 |
17659 | es.wikiloc.com | 20596 | 4.88 | 200 | HTML 5 |
17660 | confidentialcomputing.io | 20597 | 4.88 | 200 | HTML 5, English |
17661 | sparkpeople.com | 20598 | 4.88 | 200 | English |
17662 | bisnow.com | 20599 | 4.88 | 200 | HTML 5, English |
17663 | warren.senate.gov | 20600 | 4.88 | 200 | HTML 5, English |
17664 | climatesafety.info | 20601 | 4.88 | 200 | HTML 5, English |
17665 | salsalabs.com | 20602 | 4.88 | 200 | HTML 5, English |
17666 | ncwit.org | 20603 | 4.88 | 200 | HTML 5, English |
17667 | geo.tv | 20604 | 4.88 | 200 | HTML 5, English |
17668 | fantacalcio.it | 20606 | 4.88 | 200 | HTML 5 |
17669 | lccn.loc.gov | 20607 | 4.88 | 200 | HTML 5, English |
17670 | attio.com | 20608 | 4.88 | 200 | HTML 5, English |
17671 | trimble.com | 20609 | 4.88 | 200 | HTML 5, No Lang |
17672 | v8.1c.ru | 20610 | 4.88 | 200 | HTML 5 |
17673 | virtuoso.openlinksw.com | 20611 | 4.88 | 200 | No Lang |
17674 | bmfsfj.de | 20612 | 4.88 | 200 | HTML 5 |
17675 | gdoc.pub | 20613 | 4.88 | 200 | HTML 5, English |
17676 | siepomaga.pl | 20614 | 4.88 | 200 | HTML 5 |
17677 | news.gatech.edu | 20615 | 4.88 | 200 | HTML 5, English |
17678 | patronite.pl | 20616 | 4.88 | 200 | HTML 5 |
17679 | mac.eltima.com | 20617 | 4.88 | 200 | HTML 5, English |
17680 | sunshinecoast.qld.gov.au | 20618 | 4.88 | 200 | HTML 5, English |
17681 | thenibble.com | 20619 | 4.88 | 200 | HTML 5, No Lang |
17682 | ge.globo.com | 20620 | 4.88 | 200 | HTML 5 |
17683 | tntdrama.com | 20621 | 4.88 | 200 | HTML 5, English |
17684 | wikiversity.org | 20622 | 4.88 | 200 | HTML 5, English |
17685 | contextis.com | 20623 | 4.88 | 200 | HTML 5, English |
17686 | divephotoguide.com | 20624 | 4.88 | 200 | HTML 5, English |
17687 | news.wttw.com | 20625 | 4.88 | 200 | HTML 5, No Lang |
17688 | webdesignerwall.com | 20626 | 4.88 | 200 | HTML 5, English |
17689 | xignite.com | 20627 | 4.88 | 200 | HTML 5, English |
17690 | newsnetwork.mayoclinic.org | 20628 | 4.88 | 200 | HTML 5, English |
17691 | melbournefc.com.au | 20629 | 4.88 | 200 | HTML 5, English |
17692 | solingen.de | 20630 | 4.88 | 200 | HTML 5 |
17693 | farm2.static.flickr.com | 20631 | 4.88 | 200 | No Lang |
17694 | lapatria.com | 20632 | 4.88 | 200 | HTML 5 |
17695 | jiocinema.com | 20633 | 4.88 | 200 | HTML 5, English |
17696 | capradio.org | 20634 | 4.88 | 200 | HTML 5, English |
17697 | snpp.com | 20635 | 4.88 | 200 | HTML 5, No Lang |
17698 | numworks.com | 20636 | 4.88 | 200 | HTML 5, English |
17699 | indieauth.net | 20637 | 4.88 | 200 | HTML 5, No Lang |
17700 | developer.tomtom.com | 20639 | 4.88 | 200 | HTML 5, No Lang |
Data from: Open PageRank