Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
6601 | e15.cz | 7740 | 5.19 | 200 | HTML 5 |
6602 | timeslive.co.za | 7741 | 5.19 | 200 | HTML 5, English |
6603 | craftsy.com | 7742 | 5.19 | 200 | HTML 5, English |
6604 | dbs.com | 7743 | 5.19 | 200 | HTML 5, No Lang |
6605 | qonto.com | 7744 | 5.19 | 200 | HTML 5, English |
6606 | intermountainhealthcare.org | 7746 | 5.19 | 200 | HTML 5, English |
6607 | static.addtoany.com | 7747 | 5.19 | 200 | No Lang |
6608 | mattermost.com | 7748 | 5.19 | 200 | HTML 5, English |
6609 | pixelfed.org | 7749 | 5.19 | 200 | HTML 5, English |
6610 | sanluisobispo.com | 7750 | 5.19 | 200 | HTML 5, English |
6611 | jwst.nasa.gov | 7751 | 5.19 | 200 | HTML 5, English |
6612 | koreaboo.com | 7752 | 5.19 | 200 | HTML 5, English |
6613 | news.adobe.com | 7753 | 5.19 | 200 | HTML 5, No Lang |
6614 | sunoutdoors.com | 7754 | 5.19 | 200 | HTML 5, English |
6615 | lbank.com | 7755 | 5.19 | 200 | HTML 5, English |
6616 | translate.wordpress.org | 7756 | 5.19 | 200 | HTML 5, English |
6617 | simplemaps.com | 7757 | 5.19 | 200 | HTML 5, English |
6618 | kansascity.com | 7758 | 5.19 | 200 | HTML 5, English |
6619 | fibre2fashion.com | 7759 | 5.19 | 200 | HTML 5, English |
6620 | iii.org | 7760 | 5.19 | 200 | English |
6621 | crocoblock.com | 7761 | 5.19 | 200 | HTML 5, English |
6622 | koreatimes.co.kr | 7762 | 5.19 | 200 | No Lang |
6623 | insights.sei.cmu.edu | 7763 | 5.19 | 200 | HTML 5, English |
6624 | pehub.com | 7764 | 5.19 | 200 | HTML 5, English |
6625 | eacea.ec.europa.eu | 7765 | 5.19 | 200 | English, Transitional |
6626 | rent.com | 7766 | 5.19 | 200 | HTML 5, English |
6627 | webrtc.org | 7767 | 5.19 | 200 | HTML 5, English |
6628 | health.nsw.gov.au | 7768 | 5.19 | 200 | HTML 5, English |
6629 | rode.com | 7770 | 5.19 | 200 | HTML 5, English |
6630 | podio.com | 7771 | 5.19 | 200 | HTML 5, English |
6631 | nutshell.com | 7772 | 5.19 | 200 | HTML 5, English |
6632 | perkins.org | 7773 | 5.19 | 200 | HTML 5, English |
6633 | nozbe.com | 7774 | 5.19 | 200 | HTML 5, English |
6634 | nick.com | 7775 | 5.19 | 200 | HTML 5, English |
6635 | aalto.fi | 7776 | 5.19 | 200 | HTML 5 |
6636 | gulfnews.com | 7778 | 5.19 | 200 | HTML 5, English |
6637 | gu.se | 7779 | 5.19 | 200 | HTML 5 |
6638 | flathub.org | 7780 | 5.19 | 200 | HTML 5, English |
6639 | prada.com | 7781 | 5.19 | 200 | HTML 5, English |
6640 | english.elpais.com | 7782 | 5.19 | 200 | HTML 5, English |
6641 | t.snapchat.com | 7783 | 5.19 | 200 | HTML 5, English |
6642 | uk.news.yahoo.com | 7784 | 5.19 | 200 | HTML 5, No Lang |
6643 | ellenmacarthurfoundation.org | 7785 | 5.19 | 200 | HTML 5, English |
6644 | anandtech.com | 7786 | 5.19 | 200 | HTML 5, No Lang |
6645 | ga.gov.au | 7787 | 5.19 | 200 | HTML 5, English |
6646 | soundonsound.com | 7788 | 5.19 | 200 | HTML 5, English |
6647 | urlencoder.org | 7789 | 5.19 | 200 | HTML 5, English |
6648 | loblaw.ca | 7790 | 5.19 | 200 | HTML 5, No Lang |
6649 | alternet.org | 7791 | 5.19 | 200 | HTML 5, English |
6650 | nrc.gov | 7792 | 5.19 | 200 | HTML 5, English |
6651 | xunta.gal | 7793 | 5.19 | 200 | HTML 5 |
6652 | alpinelinux.org | 7794 | 5.19 | 200 | HTML 5, English |
6653 | dealdash.com | 7795 | 5.19 | 200 | HTML 5, English |
6654 | afsp.org | 7797 | 5.18 | 200 | HTML 5, English |
6655 | pasteboard.co | 7798 | 5.18 | 200 | HTML 5, No Lang |
6656 | archello.com | 7799 | 5.18 | 200 | HTML 5, English |
6657 | 24ways.org | 7800 | 5.18 | 200 | HTML 5, English |
6658 | elisa.fi | 7801 | 5.18 | 200 | HTML 5 |
6659 | ufmg.br | 7802 | 5.18 | 200 | HTML 5 |
6660 | mail.com | 7807 | 5.18 | 200 | HTML 5, English |
6661 | genomebiology.biomedcentral.com | 7808 | 5.18 | 200 | HTML 5, English |
6662 | blogs.technet.microsoft.com | 7809 | 5.18 | 200 | HTML 5, English |
6663 | betterexplained.com | 7810 | 5.18 | 200 | HTML 5, English |
6664 | provenexpert.com | 7811 | 5.18 | 200 | HTML 5, English |
6665 | tagesanzeiger.ch | 7812 | 5.18 | 200 | HTML 5 |
6666 | hydroquebec.com | 7813 | 5.18 | 200 | No Lang, Transitional |
6667 | calpoly.edu | 7814 | 5.18 | 200 | HTML 5, English |
6668 | rm.coe.int | 7815 | 5.18 | 200 | No Lang |
6669 | global.jaxa.jp | 7816 | 5.18 | 200 | English, Transitional |
6670 | pantheon.io | 7817 | 5.18 | 200 | HTML 5, English |
6671 | fontsinuse.com | 7818 | 5.18 | 200 | HTML 5, English |
6672 | vincecamuto.com | 7819 | 5.18 | 200 | HTML 5, English |
6673 | app.swaggerhub.com | 7820 | 5.18 | 200 | HTML 5, English |
6674 | mises.org | 7821 | 5.18 | 200 | HTML 5, English |
6675 | shortform.com | 7822 | 5.18 | 200 | HTML 5, English |
6676 | riverfronttimes.com | 7823 | 5.18 | 200 | HTML 5, English |
6677 | english.visitkorea.or.kr | 7824 | 5.18 | 200 | HTML 5, English |
6678 | databox.com | 7825 | 5.18 | 200 | HTML 5, English |
6679 | cope.es | 7826 | 5.18 | 200 | HTML 5 |
6680 | blogs.plos.org | 7827 | 5.18 | 200 | HTML 5, English |
6681 | uts.edu.au | 7829 | 5.18 | 200 | HTML 5, English |
6682 | looker.com | 7830 | 5.18 | 200 | HTML 5, English |
6683 | webcitation.org | 7831 | 5.18 | 200 | English, Strict |
6684 | offbeat.com | 7832 | 5.18 | 200 | HTML 5, English |
6685 | diyphotography.net | 7833 | 5.18 | 200 | HTML 5, English |
6686 | firstmonday.org | 7834 | 5.18 | 200 | HTML 5, English |
6687 | sandia.gov | 7835 | 5.18 | 200 | HTML 5, English |
6688 | llnl.gov | 7837 | 5.18 | 200 | HTML 5, English |
6689 | owletcare.com | 7839 | 5.18 | 200 | HTML 5, English |
6690 | ga-dev-tools.appspot.com | 7841 | 5.18 | 200 | HTML 5, No Lang |
6691 | informationisbeautiful.net | 7842 | 5.18 | 200 | HTML 5, English |
6692 | globalbankingandfinance.com | 7843 | 5.18 | 200 | HTML 5, English |
6693 | thepaypers.com | 7844 | 5.18 | 200 | English, Transitional |
6694 | couchsurfing.com | 7845 | 5.18 | 200 | HTML 5, English |
6695 | guardianproject.info | 7846 | 5.18 | 200 | HTML 5, English |
6696 | gem.godaddy.com | 7848 | 5.18 | 200 | HTML 5, English |
6697 | agora.io | 7849 | 5.18 | 200 | HTML 5, English |
6698 | feedster.com | 7850 | 5.18 | 200 | HTML 5, No Lang |
6699 | greentechmedia.com | 7851 | 5.18 | 200 | HTML 5, No Lang |
6700 | html5test.com | 7854 | 5.18 | 200 | HTML 5, English |
Data from: Open PageRank