Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16001 | wa.gov.au | 18645 | 4.91 | 200 | HTML 5, English |
16002 | gihyo.jp | 18646 | 4.91 | 200 | HTML 5 |
16003 | rtsoft.com | 18647 | 4.91 | 200 | English, Strict |
16004 | nordpass.com | 18649 | 4.91 | 200 | HTML 5, English |
16005 | janmarijnissen.nl | 18650 | 4.91 | 200 | No Lang, Transitional |
16006 | kansascityfed.org | 18651 | 4.91 | 200 | English |
16007 | ct24.ceskatelevize.cz | 18652 | 4.91 | 200 | HTML 5 |
16008 | motorsport-magazin.com | 18653 | 4.91 | 200 | HTML 5 |
16009 | erlang.org | 18654 | 4.91 | 200 | HTML 5, English |
16010 | blipfoto.com | 18655 | 4.91 | 200 | HTML 5, No Lang |
16011 | sistersofmercy.org | 18656 | 4.91 | 200 | HTML 5, English |
16012 | umc.org | 18657 | 4.91 | 200 | HTML 5, English |
16013 | anthem.com | 18658 | 4.91 | 200 | HTML 5, English |
16014 | tower.jp | 18659 | 4.91 | 200 | No Lang, Transitional |
16015 | fastmarkets.com | 18660 | 4.91 | 200 | HTML 5, English |
16016 | pandoc.org | 18661 | 4.91 | 200 | HTML 5, English |
16017 | text.com | 18662 | 4.91 | 200 | HTML 5, English |
16018 | scholar.google.it | 18663 | 4.91 | 200 | HTML 5, No Lang |
16019 | surveyheart.com | 18664 | 4.91 | 200 | HTML 5, No Lang |
16020 | library.fes.de | 18667 | 4.91 | 200 | HTML 5 |
16021 | qemu.org | 18669 | 4.91 | 200 | HTML 5, English |
16022 | vmi.lt | 18671 | 4.91 | 200 | HTML 5 |
16023 | insanelygoodrecipes.com | 18672 | 4.91 | 200 | HTML 5, English |
16024 | drf.com | 18673 | 4.91 | 200 | HTML 5, English |
16025 | en.wikivoyage.org | 18674 | 4.91 | 200 | HTML 5, No Lang |
16026 | readyfor.jp | 18675 | 4.91 | 200 | HTML 5 |
16027 | eastwoodguitars.com | 18676 | 4.91 | 200 | HTML 5, English |
16028 | astrazeneca.com | 18678 | 4.91 | 200 | HTML 5, English |
16029 | lta.gov.sg | 18680 | 4.91 | 200 | HTML 5, No Lang |
16030 | developer.cisco.com | 18681 | 4.91 | 200 | HTML 5, English |
16031 | desktop.github.com | 18683 | 4.91 | 200 | HTML 5, English |
16032 | radiohamburg.de | 18684 | 4.91 | 200 | HTML 5 |
16033 | xanthir.com | 18685 | 4.91 | 200 | HTML 5, No Lang |
16034 | publishpress.com | 18686 | 4.91 | 200 | HTML 5, English |
16035 | lens.blogs.nytimes.com | 18689 | 4.91 | 200 | HTML 5, English |
16036 | mhpbooks.com | 18690 | 4.91 | 200 | No Lang |
16037 | start.stockholm | 18691 | 4.91 | 200 | HTML 5 |
16038 | theface.com | 18692 | 4.91 | 200 | HTML 5, English |
16039 | mobilemonkey.com | 18693 | 4.91 | 200 | HTML 5, English |
16040 | virusbulletin.com | 18694 | 4.91 | 200 | HTML 5, English |
16041 | webpt.com | 18695 | 4.91 | 200 | HTML 5, English |
16042 | metropoleruhr.de | 18696 | 4.91 | 200 | HTML 5 |
16043 | npca.org | 18697 | 4.91 | 200 | HTML 5, English |
16044 | cefic.org | 18698 | 4.91 | 200 | HTML 5, No Lang |
16045 | automobiles.honda.com | 18699 | 4.91 | 200 | HTML 5, English |
16046 | event.webinarjam.com | 18700 | 4.91 | 200 | HTML 5, English |
16047 | dailytimes.com.pk | 18701 | 4.91 | 200 | HTML 5, English |
16048 | bitsavers.org | 18704 | 4.91 | 200 | No Lang |
16049 | mmafighting.com | 18705 | 4.91 | 200 | HTML 5, English |
16050 | critrole.com | 18708 | 4.91 | 200 | HTML 5, English |
16051 | sparkasse.de | 18709 | 4.91 | 200 | HTML 5 |
16052 | blog.arduino.cc | 18710 | 4.91 | 200 | HTML 5, English |
16053 | nae.edu | 18715 | 4.91 | 200 | HTML 5, English |
16054 | colorlines.com | 18716 | 4.91 | 200 | HTML 5, English |
16055 | waff.com | 18717 | 4.91 | 200 | HTML 5, English |
16056 | home.webinarjam.com | 18718 | 4.91 | 200 | No Lang |
16057 | whitecube.com | 18719 | 4.91 | 200 | HTML 5, English |
16058 | qustodio.com | 18720 | 4.91 | 200 | HTML 5, English |
16059 | web.cs.dal.ca | 18721 | 4.91 | 200 | HTML 5, English |
16060 | data.public.lu | 18722 | 4.91 | 200 | HTML 5 |
16061 | git.gnome.org | 18723 | 4.91 | 200 | HTML 5, No Lang |
16062 | tryinteract.com | 18725 | 4.91 | 200 | HTML 5, No Lang |
16063 | the-numbers.com | 18726 | 4.91 | 200 | HTML 5, No Lang |
16064 | android.wordpress.org | 18727 | 4.91 | 200 | HTML 5, English |
16065 | metservice.com | 18728 | 4.91 | 200 | HTML 5, English |
16066 | portals.iucn.org | 18730 | 4.91 | 200 | No Lang, Strict |
16067 | molit.go.kr | 18731 | 4.91 | 200 | Transitional |
16068 | pling.com | 18733 | 4.91 | 200 | HTML 5, English |
16069 | xmlgraphics.apache.org | 18735 | 4.91 | 200 | HTML 5, English |
16070 | jiji.com | 18736 | 4.91 | 200 | HTML 5 |
16071 | outreachy.org | 18737 | 4.91 | 200 | HTML 5, No Lang |
16072 | designwall.com | 18738 | 4.91 | 200 | HTML 5, English |
16073 | themebeez.com | 18739 | 4.91 | 200 | HTML 5, English |
16074 | waterfootprint.org | 18740 | 4.91 | 200 | HTML 5, English |
16075 | nbs.rs | 18741 | 4.91 | 200 | HTML 5, No Lang |
16076 | bbs.archlinux.org | 18742 | 4.91 | 200 | English, Strict |
16077 | missiveapp.com | 18743 | 4.91 | 200 | HTML 5, English |
16078 | truendo.com | 18745 | 4.91 | 200 | HTML 5, English |
16079 | palscity.com | 18746 | 4.91 | 200 | HTML 5, English |
16080 | fancyapps.com | 18747 | 4.91 | 200 | HTML 5, English |
16081 | gold.org | 18748 | 4.91 | 200 | HTML 5, English |
16082 | ozodlik.org | 18749 | 4.91 | 200 | HTML 5 |
16083 | spain.info | 18750 | 4.91 | 200 | HTML 5, English |
16084 | nakamichi-usa.com | 18751 | 4.91 | 200 | HTML 5, English |
16085 | dakine.com | 18752 | 4.91 | 200 | HTML 5, English |
16086 | presis.nl | 18753 | 4.91 | 200 | HTML 5 |
16087 | secure.givelively.org | 18754 | 4.91 | 200 | HTML 5, English |
16088 | codepublishing.com | 18755 | 4.91 | 200 | HTML 5, English |
16089 | wxwidgets.org | 18756 | 4.91 | 200 | HTML 5, No Lang |
16090 | harvesthq.github.io | 18757 | 4.91 | 200 | HTML 5, No Lang |
16091 | photopea.com | 18758 | 4.91 | 200 | English |
16092 | forums.digitalpoint.com | 18759 | 4.91 | 200 | HTML 5, English |
16093 | evolvingtable.com | 18760 | 4.91 | 200 | HTML 5, English |
16094 | majestic.com | 18761 | 4.91 | 200 | HTML 5, English |
16095 | magazinec.com | 18762 | 4.91 | 200 | HTML 5, English |
16096 | fbref.com | 18764 | 4.91 | 200 | HTML 5, English |
16097 | scotlandscensus.gov.uk | 18767 | 4.91 | 200 | HTML 5, English |
16098 | td.org | 18768 | 4.91 | 200 | HTML 5, English |
16099 | oeamtc.at | 18769 | 4.91 | 200 | HTML 5 |
16100 | en.freetobook.com | 18770 | 4.91 | 200 | HTML 5, No Lang |
Data from: Open PageRank