Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
5901 | s3.eu-west-1.amazonaws.com | 6924 | 5.23 | 200 | HTML 5, English |
5902 | gitbook.com | 6927 | 5.23 | 200 | HTML 5, English |
5903 | governo.it | 6928 | 5.23 | 200 | HTML 5 |
5904 | apppresser.com | 6929 | 5.23 | 200 | HTML 5, English |
5905 | trac.ffmpeg.org | 6930 | 5.23 | 200 | HTML 5, English |
5906 | buytaert.net | 6931 | 5.23 | 200 | HTML 5, English |
5907 | flightstats.com | 6932 | 5.23 | 200 | English |
5908 | sandiego.gov | 6934 | 5.23 | 200 | HTML 5, English |
5909 | philipwalton.com | 6935 | 5.23 | 200 | HTML 5, English |
5910 | meetingorganizer.copernicus.org | 6937 | 5.23 | 200 | English, Transitional |
5911 | outsideonline.com | 6939 | 5.23 | 200 | HTML 5, English |
5912 | checkout.com | 6940 | 5.23 | 200 | HTML 5, English |
5913 | iep.utm.edu | 6941 | 5.23 | 200 | HTML 5, English |
5914 | ilovepdf.com | 6942 | 5.23 | 200 | HTML 5, English |
5915 | acronymfinder.com | 6943 | 5.23 | 200 | HTML 5, English |
5916 | designmodo.com | 6944 | 5.23 | 200 | HTML 5, English |
5917 | math.stackexchange.com | 6945 | 5.23 | 200 | HTML 5, English |
5918 | goalzero.com | 6946 | 5.23 | 200 | HTML 5, English |
5919 | spectrum.com | 6947 | 5.23 | 200 | HTML 5, English |
5920 | purina.com | 6948 | 5.23 | 200 | HTML 5, English |
5921 | lesswrong.com | 6949 | 5.23 | 200 | HTML 5, English |
5922 | elfinanciero.com.mx | 6950 | 5.23 | 200 | HTML 5 |
5923 | cityu.edu.hk | 6951 | 5.23 | 200 | HTML 5, English |
5924 | health.govt.nz | 6952 | 5.23 | 200 | HTML 5, English |
5925 | wegmans.com | 6953 | 5.23 | 200 | HTML 5, English |
5926 | vitsoe.com | 6955 | 5.23 | 200 | HTML 5, English |
5927 | vids.myspace.com | 6956 | 5.23 | 200 | HTML 5, No Lang |
5928 | asa.org.uk | 6957 | 5.23 | 200 | HTML 5, No Lang |
5929 | obi.de | 6958 | 5.23 | 200 | HTML 5 |
5930 | images.nasa.gov | 6959 | 5.23 | 200 | HTML 5, English |
5931 | fueleconomy.gov | 6960 | 5.23 | 200 | HTML 5, English |
5932 | hanselman.com | 6961 | 5.23 | 200 | HTML 5, English |
5933 | webtv.un.org | 6963 | 5.23 | 200 | HTML 5, English |
5934 | hey.com | 6964 | 5.23 | 200 | HTML 5, English |
5935 | fundable.com | 6965 | 5.23 | 200 | HTML 5, English |
5936 | daily.co.jp | 6966 | 5.23 | 200 | HTML 5 |
5937 | support.squarespace.com | 6967 | 5.23 | 200 | HTML 5, English |
5938 | sesamestreet.org | 6968 | 5.23 | 200 | HTML 5, English |
5939 | nuxtjs.org | 6969 | 5.23 | 200 | HTML 5, English |
5940 | x.org | 6970 | 5.23 | 200 | No Lang, Strict |
5941 | mylifetime.com | 6971 | 5.23 | 200 | HTML 5, English |
5942 | guggenheim.org | 6972 | 5.23 | 200 | HTML 5, English |
5943 | marc.info | 6973 | 5.23 | 200 | No Lang |
5944 | accorhotels.com | 6974 | 5.23 | 200 | HTML 5, English |
5945 | newsobserver.com | 6975 | 5.23 | 200 | HTML 5, English |
5946 | cronista.com | 6976 | 5.23 | 200 | HTML 5 |
5947 | humanesociety.org | 6977 | 5.23 | 200 | HTML 5, English |
5948 | gusto.com | 6978 | 5.23 | 200 | HTML 5, English |
5949 | docplayer.net | 6979 | 5.23 | 200 | HTML 5, English |
5950 | matrix.to | 6980 | 5.23 | 200 | HTML 5, No Lang |
5951 | earthengine.google.com | 6982 | 5.22 | 200 | HTML 5, No Lang |
5952 | write.as | 6983 | 5.22 | 200 | HTML 5, No Lang |
5953 | eprints.soton.ac.uk | 6984 | 5.22 | 200 | No Lang, Transitional |
5954 | findmypast.co.uk | 6985 | 5.22 | 200 | HTML 5, English |
5955 | tricycle.org | 6986 | 5.22 | 200 | HTML 5, English |
5956 | real.com | 6988 | 5.22 | 200 | HTML 5, English |
5957 | leanin.org | 6989 | 5.22 | 200 | HTML 5, English |
5958 | cracked.com | 6990 | 5.22 | 200 | HTML 5, English |
5959 | ggnome.com | 6991 | 5.22 | 200 | HTML 5, English |
5960 | jmlr.org | 6992 | 5.22 | 200 | No Lang |
5961 | taipeitimes.com | 6993 | 5.22 | 200 | HTML 5, No Lang |
5962 | ionic.io | 6994 | 5.22 | 200 | HTML 5, English |
5963 | docs.splunk.com | 6995 | 5.22 | 200 | HTML 5, No Lang |
5964 | asiatimes.com | 6996 | 5.22 | 200 | HTML 5, English |
5965 | linkin.bio | 6999 | 5.22 | 200 | HTML 5, No Lang |
5966 | fr.news.yahoo.com | 7000 | 5.22 | 200 | HTML 5, No Lang |
5967 | auspost.com.au | 7001 | 5.22 | 200 | HTML 5, English |
5968 | waitbutwhy.com | 7002 | 5.22 | 200 | HTML 5, English |
5969 | ethanschoonover.com | 7003 | 5.22 | 200 | HTML 5, English |
5970 | humansecurity.com | 7004 | 5.22 | 200 | HTML 5, English |
5971 | software.intel.com | 7005 | 5.22 | 200 | HTML 5, English |
5972 | oxforddictionaries.com | 7006 | 5.22 | 200 | HTML 5, English |
5973 | emag.hu | 7007 | 5.22 | 200 | HTML 5 |
5974 | mkyong.com | 7008 | 5.22 | 200 | HTML 5, English |
5975 | zenhabits.net | 7009 | 5.22 | 200 | HTML 5, English |
5976 | tokyoartbeat.com | 7010 | 5.22 | 200 | HTML 5 |
5977 | wbcsd.org | 7011 | 5.22 | 200 | HTML 5, English |
5978 | bentley.edu | 7012 | 5.22 | 200 | HTML 5, English |
5979 | villagevoice.com | 7013 | 5.22 | 200 | HTML 5, English |
5980 | cygwin.com | 7014 | 5.22 | 200 | HTML 5, English |
5981 | ferrari.com | 7015 | 5.22 | 200 | HTML 5, English |
5982 | radios.ebc.com.br | 7016 | 5.22 | 200 | HTML 5 |
5983 | forum.effectivealtruism.org | 7017 | 5.22 | 200 | HTML 5, English |
5984 | paytm.com | 7019 | 5.22 | 200 | HTML 5, English |
5985 | spark.apache.org | 7020 | 5.22 | 200 | HTML 5, English |
5986 | mta.info | 7021 | 5.22 | 200 | English |
5987 | you.com | 7023 | 5.22 | 200 | HTML 5, English |
5988 | publichealth.lacounty.gov | 7024 | 5.22 | 200 | English |
5989 | ynet.co.il | 7025 | 5.22 | 200 | HTML 5 |
5990 | petmd.com | 7026 | 5.22 | 200 | HTML 5, English |
5991 | wm.com | 7029 | 5.22 | 200 | HTML 5, English |
5992 | roboform.com | 7030 | 5.22 | 200 | HTML 5, English |
5993 | qsrmagazine.com | 7031 | 5.22 | 200 | HTML 5, English |
5994 | freedesktop.org | 7034 | 5.22 | 200 | No Lang, Strict |
5995 | truthsocial.com | 7036 | 5.22 | 200 | HTML 5, English |
5996 | medicinenet.com | 7037 | 5.22 | 200 | HTML 5, English |
5997 | intercom.help | 7038 | 5.22 | 200 | HTML 5, English |
5998 | nrcs.usda.gov | 7040 | 5.22 | 200 | HTML 5, English |
5999 | pe.usps.com | 7041 | 5.22 | 200 | HTML 5, English |
6000 | joinhoney.com | 7042 | 5.22 | 200 | HTML 5, English |
Data from: Open PageRank