Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
7501 | rd.usda.gov | 8768 | 5.14 | 200 | HTML 5, English |
7502 | hsbc.co.uk | 8769 | 5.14 | 200 | HTML 5, English |
7503 | brain.fm | 8771 | 5.14 | 200 | HTML 5, No Lang |
7504 | daserste.de | 8772 | 5.14 | 200 | HTML 5 |
7505 | phillymag.com | 8773 | 5.14 | 200 | HTML 5, English |
7506 | intothegloss.com | 8774 | 5.14 | 200 | HTML 5, English |
7507 | cruisemapper.com | 8775 | 5.14 | 200 | HTML 5, English |
7508 | flutter.dev | 8776 | 5.14 | 200 | HTML 5, English |
7509 | education.com | 8777 | 5.14 | 200 | HTML 5, English |
7510 | milb.com | 8778 | 5.14 | 200 | HTML 5, English |
7511 | amazon.nl | 8779 | 5.14 | 200 | HTML 5 |
7512 | cs.uwaterloo.ca | 8781 | 5.14 | 200 | HTML 5, English |
7513 | food.gov.uk | 8782 | 5.14 | 200 | HTML 5, English |
7514 | browsehappy.com | 8783 | 5.14 | 200 | HTML 5, English |
7515 | web.media.mit.edu | 8784 | 5.14 | 200 | HTML 5, No Lang |
7516 | calgaryherald.com | 8785 | 5.14 | 200 | HTML 5, No Lang |
7517 | webchat.freenode.net | 8786 | 5.14 | 200 | HTML 5, No Lang |
7518 | hoopladigital.com | 8788 | 5.14 | 200 | HTML 5, English |
7519 | tmj4.com | 8789 | 5.14 | 200 | HTML 5, English |
7520 | wiki.debian.org | 8791 | 5.14 | 200 | No Lang, Strict |
7521 | surrey.ac.uk | 8792 | 5.14 | 200 | HTML 5, English |
7522 | dailygalaxy.com | 8793 | 5.14 | 200 | HTML 5, English |
7523 | arm.com | 8794 | 5.14 | 200 | HTML 5, English |
7524 | kakaocorp.com | 8795 | 5.14 | 200 | HTML 5 |
7525 | hueniverse.com | 8796 | 5.14 | 200 | HTML 5, No Lang |
7526 | london.gov.uk | 8797 | 5.14 | 200 | HTML 5, English |
7527 | nchsoftware.com | 8798 | 5.14 | 200 | English |
7528 | gosanangelo.com | 8799 | 5.14 | 200 | HTML 5, English |
7529 | thegamer.com | 8800 | 5.14 | 200 | HTML 5, English |
7530 | zenbusiness.com | 8801 | 5.14 | 200 | HTML 5, English |
7531 | thecrimson.com | 8802 | 5.14 | 200 | HTML 5, No Lang |
7532 | contently.com | 8803 | 5.14 | 200 | HTML 5, English |
7533 | space.bilibili.com | 8804 | 5.14 | 200 | HTML 5, No Lang |
7534 | gazeta.ru | 8805 | 5.14 | 200 | HTML 5 |
7535 | opovo.com.br | 8806 | 5.14 | 200 | HTML 5 |
7536 | oyez.org | 8807 | 5.14 | 200 | HTML 5, English |
7537 | scholarsarchive.byu.edu | 8808 | 5.14 | 200 | HTML 5, English |
7538 | us04web.zoom.us | 8809 | 5.14 | 200 | HTML 5, English |
7539 | meteoblue.com | 8810 | 5.14 | 200 | HTML 5, English |
7540 | tuaw.com | 8811 | 5.14 | 200 | HTML 5, English |
7541 | memberpress.com | 8812 | 5.14 | 200 | HTML 5, English |
7542 | funko.com | 8813 | 5.14 | 200 | HTML 5, English |
7543 | reutersinstitute.politics.ox.ac.uk | 8814 | 5.14 | 200 | HTML 5, English |
7544 | nicekicks.com | 8815 | 5.14 | 200 | HTML 5, English |
7545 | uxmag.com | 8816 | 5.14 | 200 | HTML 5, English |
7546 | betterhealth.vic.gov.au | 8817 | 5.14 | 200 | HTML 5, English |
7547 | cycling74.com | 8818 | 5.14 | 200 | HTML 5, English |
7548 | lwl.org | 8819 | 5.14 | 200 | HTML 5 |
7549 | publicintegrity.org | 8820 | 5.14 | 200 | HTML 5, English |
7550 | patents.justia.com | 8821 | 5.14 | 200 | HTML 5, English |
7551 | al-ain.com | 8822 | 5.14 | 200 | HTML 5 |
7552 | afro.who.int | 8823 | 5.14 | 200 | HTML 5, English |
7553 | ev.buaa.edu.cn | 8824 | 5.14 | 200 | English |
7554 | techpresident.com | 8825 | 5.14 | 200 | HTML 5, English |
7555 | indiewebcamp.com | 8826 | 5.14 | 200 | HTML 5, No Lang |
7556 | rcsb.org | 8827 | 5.14 | 200 | HTML 5, English |
7557 | lanyrd.com | 8828 | 5.14 | 200 | HTML 5, No Lang |
7558 | macupdate.com | 8829 | 5.14 | 200 | HTML 5, No Lang |
7559 | estrepublicain.fr | 8830 | 5.14 | 200 | HTML 5 |
7560 | flowpaper.com | 8831 | 5.14 | 200 | HTML 5, English |
7561 | impawards.com | 8832 | 5.14 | 200 | HTML 5, English |
7562 | letelegramme.fr | 8834 | 5.14 | 200 | HTML 5 |
7563 | momondo.com | 8835 | 5.14 | 200 | HTML 5, English |
7564 | anaconda.com | 8836 | 5.14 | 200 | HTML 5, English |
7565 | snu.ac.kr | 8837 | 5.14 | 200 | HTML 5 |
7566 | devops.com | 8839 | 5.14 | 200 | HTML 5, English |
7567 | kpn.com | 8840 | 5.14 | 200 | HTML 5 |
7568 | redcrossblood.org | 8841 | 5.14 | 200 | HTML 5, English |
7569 | stlmag.com | 8842 | 5.14 | 200 | HTML 5, English |
7570 | mail.python.org | 8844 | 5.14 | 200 | No Lang |
7571 | law.nyu.edu | 8845 | 5.14 | 200 | HTML 5, English |
7572 | distrowatch.com | 8846 | 5.14 | 200 | HTML 5, No Lang |
7573 | usu.edu | 8848 | 5.14 | 200 | HTML 5, English |
7574 | sendinblue.com | 8849 | 5.14 | 200 | HTML 5, English |
7575 | pressakey.com | 8850 | 5.14 | 200 | HTML 5 |
7576 | eca.europa.eu | 8851 | 5.14 | 200 | HTML 5, English |
7577 | tldp.org | 8852 | 5.14 | 200 | No Lang |
7578 | webtoffee.com | 8853 | 5.14 | 200 | HTML 5, English |
7579 | alumni.hbs.edu | 8854 | 5.14 | 200 | HTML 5, English |
7580 | reformjudaism.org | 8855 | 5.14 | 200 | HTML 5, English |
7581 | erc.europa.eu | 8856 | 5.14 | 200 | HTML 5, English |
7582 | inria.fr | 8857 | 5.14 | 200 | HTML 5 |
7583 | twu.edu | 8858 | 5.14 | 200 | English |
7584 | euskadi.eus | 8859 | 5.14 | 200 | HTML 5 |
7585 | bombas.com | 8860 | 5.14 | 200 | HTML 5, No Lang |
7586 | technode.com | 8861 | 5.14 | 200 | HTML 5, English |
7587 | hourofcode.com | 8863 | 5.14 | 200 | HTML 5, No Lang |
7588 | wmo.int | 8866 | 5.14 | 200 | HTML 5, English |
7589 | oireachtas.ie | 8868 | 5.14 | 200 | HTML 5, English |
7590 | thebookseller.com | 8869 | 5.14 | 200 | HTML 5, English |
7591 | torquemag.io | 8870 | 5.14 | 200 | HTML 5, English |
7592 | dhamma.org | 8871 | 5.14 | 200 | HTML 5, English |
7593 | smarthistory.org | 8874 | 5.14 | 200 | HTML 5, English |
7594 | stage32.com | 8876 | 5.14 | 200 | HTML 5, English |
7595 | anotepad.com | 8877 | 5.14 | 200 | HTML 5, English |
7596 | dunkindonuts.com | 8880 | 5.14 | 200 | HTML 5, English |
7597 | dailyrecord.co.uk | 8881 | 5.14 | 200 | HTML 5, English |
7598 | slowly.app | 8882 | 5.14 | 200 | HTML 5, English |
7599 | calltrackingmetrics.com | 8883 | 5.14 | 200 | HTML 5, English |
7600 | thehansindia.com | 8884 | 5.14 | 200 | HTML 5, English |
Data from: Open PageRank