Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19001 | scalemates.com | 22150 | 4.86 | 200 | HTML 5, English |
19002 | frontofficesports.com | 22151 | 4.86 | 200 | HTML 5, English |
19003 | iarc.fr | 22153 | 4.86 | 200 | HTML 5, English |
19004 | asianews.it | 22155 | 4.86 | 200 | HTML 5 |
19005 | mpt.org | 22156 | 4.86 | 200 | HTML 5, English |
19006 | merseytravel.gov.uk | 22157 | 4.86 | 200 | HTML 5, English |
19007 | lubimyczytac.pl | 22158 | 4.86 | 200 | HTML 5 |
19008 | youcaring.com | 22159 | 4.86 | 200 | HTML 5, English |
19009 | linuxfromscratch.org | 22160 | 4.86 | 200 | English |
19010 | tmd.go.th | 22162 | 4.86 | 200 | HTML 5 |
19011 | topquadrant.com | 22164 | 4.86 | 200 | HTML 5, English |
19012 | docs.gradle.org | 22165 | 4.86 | 200 | HTML 5, No Lang |
19013 | bfarm.de | 22166 | 4.86 | 200 | HTML 5 |
19014 | wpfavs.com | 22167 | 4.86 | 200 | HTML 5, No Lang |
19015 | defence.gov.au | 22168 | 4.86 | 200 | HTML 5, English |
19016 | formacar.com | 22169 | 4.86 | 200 | HTML 5, English |
19017 | oi.uchicago.edu | 22172 | 4.86 | 200 | HTML 5, English |
19018 | amawaterways.com | 22173 | 4.86 | 200 | HTML 5, English |
19019 | documentation.bloomreach.com | 22175 | 4.86 | 200 | HTML 5, No Lang |
19020 | africacdc.org | 22176 | 4.86 | 200 | HTML 5, English |
19021 | ecommercebytes.com | 22177 | 4.86 | 200 | HTML 5, English |
19022 | handshake.org | 22178 | 4.86 | 200 | HTML 5, English |
19023 | mindspring.com | 22179 | 4.86 | 200 | HTML 5, English |
19024 | mobiledetect.net | 22180 | 4.86 | 200 | HTML 5, English |
19025 | digitalassets.lib.berkeley.edu | 22181 | 4.86 | 200 | No Lang |
19026 | learnprompting.org | 22182 | 4.86 | 200 | HTML 5, English |
19027 | images.google.de | 22185 | 4.86 | 200 | HTML 5, English |
19028 | proprivacy.com | 22186 | 4.86 | 200 | HTML 5, English |
19029 | brandonsanderson.com | 22187 | 4.86 | 200 | HTML 5, English |
19030 | bacp.co.uk | 22188 | 4.86 | 200 | HTML 5, English |
19031 | abtesting.ai | 22189 | 4.86 | 200 | HTML 5, English |
19032 | klikki.fi | 22190 | 4.86 | 200 | HTML 5, English |
19033 | capitalgazette.com | 22191 | 4.86 | 200 | HTML 5, English |
19034 | tpl.org | 22193 | 4.86 | 200 | HTML 5, English |
19035 | msevents.microsoft.com | 22194 | 4.86 | 200 | HTML 5, English |
19036 | s3-us-east-2.amazonaws.com | 22195 | 4.86 | 200 | HTML 5, English |
19037 | a24films.com | 22196 | 4.86 | 200 | HTML 5, English |
19038 | gbes.com | 22197 | 4.86 | 200 | HTML 5, No Lang |
19039 | red3d.com | 22198 | 4.86 | 200 | No Lang, Transitional |
19040 | nonprofitquarterly.org | 22199 | 4.86 | 200 | HTML 5, English |
19041 | trustedhousesitters.com | 22200 | 4.86 | 200 | HTML 5, English |
19042 | handlebarsjs.com | 22201 | 4.86 | 200 | HTML 5, English |
19043 | ign.fr | 22202 | 4.86 | 200 | HTML 5 |
19044 | flythemes.net | 22203 | 4.86 | 200 | HTML 5, English |
19045 | vattenfall.de | 22204 | 4.86 | 200 | HTML 5 |
19046 | publicwhip.org.uk | 22205 | 4.86 | 200 | HTML 5, No Lang |
19047 | lora-alliance.org | 22206 | 4.86 | 200 | English |
19048 | sv.uio.no | 22207 | 4.86 | 200 | HTML 5 |
19049 | spaceweather.com | 22208 | 4.86 | 200 | No Lang, Transitional |
19050 | joaoleitao.com | 22210 | 4.86 | 200 | HTML 5, English |
19051 | bindingdb.org | 22211 | 4.86 | 200 | HTML 5, English |
19052 | sunsama.com | 22212 | 4.86 | 200 | HTML 5, English |
19053 | popcash.net | 22213 | 4.86 | 200 | HTML 5, English |
19054 | fakespot.com | 22214 | 4.86 | 200 | HTML 5, English |
19055 | mkaz.blog | 22215 | 4.86 | 200 | HTML 5, English |
19056 | carmensluxurytravel.com | 22218 | 4.86 | 200 | HTML 5, English |
19057 | mlflow.org | 22219 | 4.86 | 200 | HTML 5, English |
19058 | freshports.org | 22221 | 4.86 | 200 | HTML 5, English |
19059 | cawp.rutgers.edu | 22222 | 4.86 | 200 | HTML 5, English |
19060 | jvcmusic.co.jp | 22223 | 4.86 | 200 | HTML 5 |
19061 | clearbit.com | 22224 | 4.86 | 200 | HTML 5, English |
19062 | uni-koeln.de | 22225 | 4.86 | 200 | HTML 5 |
19063 | swimming.org | 22226 | 4.86 | 200 | HTML 5, English |
19064 | nationalexpress.com | 22227 | 4.86 | 200 | HTML 5, English |
19065 | cerncourier.com | 22228 | 4.86 | 200 | HTML 5, English |
19066 | breakingnews.ie | 22229 | 4.86 | 200 | HTML 5, English |
19067 | books.nap.edu | 22230 | 4.86 | 200 | HTML 5, English |
19068 | chelseafc.com | 22231 | 4.86 | 200 | HTML 5, English |
19069 | prodege.com | 22232 | 4.86 | 200 | HTML 5, English |
19070 | ucf.edu | 22233 | 4.86 | 200 | HTML 5, English |
19071 | wikinews.org | 22234 | 4.86 | 200 | HTML 5, English |
19072 | people.mpi-inf.mpg.de | 22235 | 4.86 | 200 | HTML 5, English |
19073 | museum.wa.gov.au | 22236 | 4.86 | 200 | HTML 5, English |
19074 | spinoff.nasa.gov | 22237 | 4.86 | 200 | HTML 5, English |
19075 | goodmenproject.com | 22238 | 4.86 | 200 | HTML 5, English |
19076 | lewishowes.com | 22239 | 4.86 | 200 | HTML 5, English |
19077 | blogs.kde.org | 22240 | 4.86 | 200 | HTML 5, English |
19078 | tortoisesvn.net | 22241 | 4.86 | 200 | HTML 5, English |
19079 | cis.org | 22242 | 4.86 | 200 | HTML 5, English |
19080 | nimbusweb.me | 22243 | 4.86 | 200 | HTML 5, English |
19081 | culqi.com | 22244 | 4.86 | 200 | HTML 5 |
19082 | joinhandshake.com | 22245 | 4.86 | 200 | HTML 5, English |
19083 | guernicamag.com | 22246 | 4.86 | 200 | HTML 5, English |
19084 | coxandcox.co.uk | 22247 | 4.86 | 200 | HTML 5, English |
19085 | willamette.edu | 22248 | 4.86 | 200 | HTML 5, English |
19086 | dserver.bundestag.de | 22249 | 4.86 | 200 | No Lang |
19087 | tbo.com | 22250 | 4.86 | 200 | HTML 5, English |
19088 | frogi.co.il | 22251 | 4.86 | 200 | HTML 5 |
19089 | legalseafoods.com | 22252 | 4.86 | 200 | HTML 5, English |
19090 | k3s.io | 22253 | 4.86 | 200 | HTML 5, English |
19091 | kvk.nl | 22254 | 4.86 | 200 | HTML 5 |
19092 | fishbase.org | 22255 | 4.86 | 200 | No Lang, Strict |
19093 | mcdonalds.co.jp | 22256 | 4.86 | 200 | HTML 5 |
19094 | myersbriggs.org | 22257 | 4.86 | 200 | HTML 5, English |
19095 | premiumwebsites.net | 22258 | 4.86 | 200 | HTML 5, English |
19096 | tvo.org | 22259 | 4.86 | 200 | HTML 5, English |
19097 | xm1math.net | 22260 | 4.86 | 200 | HTML 5, English |
19098 | groruddalen.no | 22261 | 4.86 | 200 | HTML 5 |
19099 | waz.de | 22264 | 4.86 | 200 | HTML 5 |
19100 | listen.hatnote.com | 22265 | 4.86 | 200 | No Lang |
Data from: Open PageRank