Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19601 | en.wikinews.org | 22871 | 4.86 | 200 | HTML 5, No Lang |
19602 | mtel.ba | 22872 | 4.86 | 200 | Transitional |
19603 | sports.betmgm.com | 22873 | 4.86 | 200 | HTML 5, English |
19604 | dlvr.it | 22874 | 4.86 | 200 | HTML 5, English |
19605 | wppusher.com | 22875 | 4.86 | 200 | HTML 5, English |
19606 | scrumguides.org | 22877 | 4.86 | 200 | HTML 5, English |
19607 | oehha.ca.gov | 22878 | 4.86 | 200 | No Lang |
19608 | sportsnaut.com | 22879 | 4.86 | 200 | HTML 5, English |
19609 | news.cs.washington.edu | 22880 | 4.86 | 200 | HTML 5, English |
19610 | socket.io | 22881 | 4.86 | 200 | HTML 5, English |
19611 | woodtv.com | 22882 | 4.86 | 200 | HTML 5, English |
19612 | alvarotrigo.com | 22883 | 4.86 | 200 | HTML 5, No Lang |
19613 | biteable.com | 22884 | 4.86 | 200 | HTML 5, English |
19614 | get.fabric.io | 22885 | 4.86 | 200 | HTML 5, English |
19615 | pixai.art | 22886 | 4.86 | 200 | HTML 5, English |
19616 | dradio.de | 22887 | 4.86 | 200 | HTML 5 |
19617 | climatewatchdata.org | 22889 | 4.86 | 200 | HTML 5, No Lang |
19618 | kcrg.com | 22890 | 4.86 | 200 | HTML 5, English |
19619 | hort.purdue.edu | 22891 | 4.86 | 200 | HTML 5, English |
19620 | tacc.utexas.edu | 22892 | 4.86 | 200 | HTML 5, English |
19621 | igem.org | 22893 | 4.86 | 200 | HTML 5, English |
19622 | wareable.com | 22894 | 4.86 | 200 | HTML 5, English |
19623 | my.mollie.com | 22897 | 4.86 | 200 | HTML 5, English |
19624 | www-304.ibm.com | 22898 | 4.84 | 200 | HTML 5, English |
19625 | marketinginsidergroup.com | 22899 | 4.84 | 200 | HTML 5, English |
19626 | metrotrains.com.au | 22900 | 4.84 | 200 | HTML 5, English |
19627 | jeffgeerling.com | 22901 | 4.84 | 200 | HTML 5, English |
19628 | wandb.ai | 22903 | 4.84 | 200 | HTML 5, English |
19629 | rstyle.me | 22904 | 4.84 | 200 | HTML 5, English |
19630 | usal.es | 22906 | 4.84 | 200 | |
19631 | csclub.uwaterloo.ca | 22907 | 4.84 | 200 | HTML 5, English |
19632 | able2know.org | 22908 | 4.84 | 200 | English, Strict |
19633 | alanis.com | 22909 | 4.84 | 200 | HTML 5, English |
19634 | lematin.ch | 22910 | 4.84 | 200 | HTML 5 |
19635 | us.jll.com | 22911 | 4.84 | 200 | HTML 5, English |
19636 | affise.com | 22912 | 4.84 | 200 | HTML 5, English |
19637 | cityroom.blogs.nytimes.com | 22914 | 4.84 | 200 | HTML 5, English |
19638 | blood.ca | 22915 | 4.84 | 200 | HTML 5, English |
19639 | revistaforum.com.br | 22916 | 4.84 | 200 | HTML 5 |
19640 | glasswire.com | 22917 | 4.84 | 200 | HTML 5, No Lang |
19641 | blog.dhimmel.com | 22918 | 4.84 | 200 | HTML 5, English |
19642 | sebastiendumont.com | 22919 | 4.84 | 200 | HTML 5, English |
19643 | vrs.de | 22920 | 4.84 | 200 | HTML 5 |
19644 | gardeningknowhow.com | 22921 | 4.84 | 200 | HTML 5, English |
19645 | themecentury.com | 22922 | 4.84 | 200 | HTML 5, English |
19646 | developer.ubuntu.com | 22923 | 4.84 | 200 | HTML 5, English |
19647 | dspguide.com | 22924 | 4.84 | 200 | No Lang, Transitional |
19648 | simbad.u-strasbg.fr | 22925 | 4.84 | 200 | No Lang, Transitional |
19649 | nu.or.id | 22926 | 4.84 | 200 | HTML 5 |
19650 | desura.com | 22927 | 4.84 | 200 | HTML 5, English |
19651 | stephanventer.com | 22928 | 4.84 | 200 | HTML 5, No Lang |
19652 | mentalhealth.gov | 22930 | 4.84 | 200 | HTML 5, English |
19653 | bacontoday.com | 22931 | 4.84 | 200 | HTML 5, English |
19654 | 6sqft.com | 22933 | 4.84 | 200 | HTML 5, English |
19655 | tedmed.com | 22934 | 4.84 | 200 | HTML 5, No Lang |
19656 | luas.ie | 22935 | 4.84 | 200 | HTML 5, English |
19657 | bitcoincore.org | 22936 | 4.84 | 200 | HTML 5, English |
19658 | frictionalgames.com | 22937 | 4.84 | 200 | HTML 5, English |
19659 | nmm.nl | 22938 | 4.84 | 200 | HTML 5 |
19660 | fvsu.edu | 22939 | 4.84 | 200 | HTML 5, English |
19661 | dced.pa.gov | 22940 | 4.84 | 200 | HTML 5, English |
19662 | youmagine.com | 22941 | 4.84 | 200 | HTML 5, English |
19663 | sticker.ly | 22943 | 4.84 | 200 | HTML 5 |
19664 | articulo.mercadolibre.com.mx | 22946 | 4.84 | 200 | HTML 5 |
19665 | inpsyde.com | 22947 | 4.84 | 200 | HTML 5, English |
19666 | plugins.matomo.org | 22948 | 4.84 | 200 | HTML 5, English |
19667 | acko.net | 22950 | 4.84 | 200 | HTML 5, No Lang |
19668 | us17.campaign-archive.com | 22951 | 4.84 | 200 | No Lang |
19669 | salzwelten.at | 22952 | 4.84 | 200 | HTML 5, English |
19670 | firstcoastnews.com | 22954 | 4.84 | 200 | HTML 5, English |
19671 | outdoors.com | 22955 | 4.84 | 200 | HTML 5, English |
19672 | phl17.com | 22956 | 4.84 | 200 | HTML 5, English |
19673 | 4h.ucanr.edu | 22957 | 4.84 | 200 | HTML 5, English |
19674 | simpleicon.com | 22958 | 4.84 | 200 | HTML 5, No Lang |
19675 | acehotel.com | 22960 | 4.84 | 200 | HTML 5, English |
19676 | themeboy.com | 22961 | 4.84 | 200 | HTML 5, English |
19677 | augusta.edu | 22962 | 4.84 | 200 | HTML 5, English |
19678 | generalmills.com | 22963 | 4.84 | 200 | HTML 5, No Lang |
19679 | wcs.org | 22964 | 4.84 | 200 | HTML 5, English |
19680 | 3ammagazine.com | 22965 | 4.84 | 200 | HTML 5, English |
19681 | wilton.com | 22966 | 4.84 | 200 | HTML 5, English |
19682 | linguisticsociety.org | 22967 | 4.84 | 200 | HTML 5, No Lang |
19683 | cska-hockey.ru | 22969 | 4.84 | 200 | HTML 5 |
19684 | piratenpartei.de | 22970 | 4.84 | 200 | HTML 5 |
19685 | dvv.fi | 22971 | 4.84 | 200 | HTML 5 |
19686 | hanshin.co.jp | 22972 | 4.84 | 200 | HTML 5 |
19687 | blog.zoom.us | 22973 | 4.84 | 200 | HTML 5, English |
19688 | cs231n.stanford.edu | 22974 | 4.84 | 200 | HTML 5, English |
19689 | ch.nicovideo.jp | 22975 | 4.84 | 200 | No Lang, Transitional |
19690 | bri.co.id | 22976 | 4.84 | 200 | HTML 5 |
19691 | bugs.kde.org | 22977 | 4.84 | 200 | HTML 5, English |
19692 | web-profile.net | 22978 | 4.84 | 200 | HTML 5, English |
19693 | ok-magazin.de | 22979 | 4.84 | 200 | HTML 5 |
19694 | bqworks.net | 22981 | 4.84 | 200 | HTML 5, No Lang |
19695 | truth-out.org | 22982 | 4.84 | 200 | HTML 5, English |
19696 | fespa.com | 22983 | 4.84 | 200 | HTML 5, English |
19697 | business.gov.au | 22984 | 4.84 | 200 | HTML 5, English |
19698 | communities.intel.com | 22986 | 4.84 | 200 | HTML 5, English |
19699 | skyatnightmagazine.com | 22987 | 4.84 | 200 | HTML 5, English |
19700 | mila.quebec | 22988 | 4.84 | 200 | HTML 5, English |
Data from: Open PageRank