Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16301 | dimensional.me | 19000 | 4.91 | 200 | HTML 5, English |
16302 | asd.gsfc.nasa.gov | 19001 | 4.91 | 200 | HTML 5, English |
16303 | story.californiasunday.com | 19002 | 4.91 | 200 | HTML 5, No Lang |
16304 | kiro7.com | 19003 | 4.91 | 200 | HTML 5, English |
16305 | csc.fi | 19004 | 4.91 | 200 | HTML 5 |
16306 | thebaffler.com | 19007 | 4.91 | 200 | HTML 5, English |
16307 | musicweek.com | 19008 | 4.91 | 200 | HTML 5, No Lang |
16308 | riflepaperco.com | 19009 | 4.91 | 200 | HTML 5, English |
16309 | spicejet.com | 19010 | 4.91 | 200 | HTML 5, No Lang |
16310 | parsely.com | 19011 | 4.91 | 200 | HTML 5, English |
16311 | studyusa.com | 19012 | 4.91 | 200 | HTML 5, English |
16312 | victronenergy.com | 19013 | 4.90 | 200 | HTML 5, No Lang |
16313 | untappedcities.com | 19014 | 4.90 | 200 | HTML 5, English |
16314 | ceres.org | 19015 | 4.90 | 200 | English |
16315 | sid.ir | 19016 | 4.90 | 200 | HTML 5 |
16316 | gettysburg.edu | 19017 | 4.90 | 200 | No Lang |
16317 | epson.jp | 19018 | 4.90 | 200 | HTML 5 |
16318 | segaretro.org | 19019 | 4.90 | 200 | HTML 5, English |
16319 | egged.co.il | 19020 | 4.90 | 200 | HTML 5 |
16320 | accupass.com | 19021 | 4.90 | 200 | HTML 5 |
16321 | us.sagepub.com | 19022 | 4.90 | 200 | HTML 5, English |
16322 | nanoleaf.me | 19023 | 4.90 | 200 | HTML 5, English |
16323 | homes.esat.kuleuven.be | 19024 | 4.90 | 200 | No Lang |
16324 | rejoiner.com | 19025 | 4.90 | 200 | HTML 5, English |
16325 | neilgaiman.com | 19026 | 4.90 | 200 | No Lang, Strict |
16326 | tcmb.gov.tr | 19027 | 4.90 | 200 | HTML 5 |
16327 | ooma.com | 19028 | 4.90 | 200 | English |
16328 | fredhutch.org | 19029 | 4.90 | 200 | HTML 5, English |
16329 | www-ssl.intel.com | 19032 | 4.90 | 200 | HTML 5, English |
16330 | valor.globo.com | 19033 | 4.90 | 200 | HTML 5 |
16331 | metfone.com.kh | 19036 | 4.90 | 200 | HTML 5, English |
16332 | nad.org | 19037 | 4.90 | 200 | HTML 5, English |
16333 | burst.shopify.com | 19038 | 4.90 | 200 | HTML 5, English |
16334 | tcl.com | 19039 | 4.90 | 200 | HTML 5, English |
16335 | get.app | 19042 | 4.90 | 200 | HTML 5, English |
16336 | issues.org | 19044 | 4.90 | 200 | HTML 5, English |
16337 | muppet.fandom.com | 19045 | 4.90 | 200 | HTML 5, English |
16338 | redislabs.com | 19046 | 4.90 | 200 | HTML 5, English |
16339 | fooducate.com | 19047 | 4.90 | 200 | HTML 5, English |
16340 | expensify.com | 19048 | 4.90 | 200 | HTML 5, No Lang |
16341 | scotiabank.com | 19049 | 4.90 | 200 | HTML 5, English |
16342 | projectworldimpact.com | 19050 | 4.90 | 200 | HTML 5, English |
16343 | exiftool.org | 19051 | 4.90 | 200 | No Lang, Transitional |
16344 | rhapsody.com | 19052 | 4.90 | 200 | HTML 5, English |
16345 | rheem.com | 19053 | 4.90 | 200 | HTML 5, English |
16346 | ntwind.com | 19054 | 4.90 | 200 | HTML 5, English |
16347 | solaresearch.org | 19055 | 4.90 | 200 | HTML 5, English |
16348 | en-au.wordpress.org | 19056 | 4.90 | 200 | HTML 5, English |
16349 | thegradient.pub | 19057 | 4.90 | 200 | HTML 5, English |
16350 | jbn.nl | 19058 | 4.90 | 200 | HTML 5 |
16351 | web.hypothes.is | 19059 | 4.90 | 200 | HTML 5, English |
16352 | photoroom.com | 19060 | 4.90 | 200 | HTML 5, English |
16353 | finland.fi | 19061 | 4.90 | 200 | HTML 5, English |
16354 | cuisine.journaldesfemmes.com | 19062 | 4.90 | 200 | |
16355 | vitalsource.com | 19064 | 4.90 | 200 | HTML 5, English |
16356 | ut.ee | 19065 | 4.90 | 200 | HTML 5 |
16357 | adstransparency.google.com | 19066 | 4.90 | 200 | HTML 5, English |
16358 | metallica.com | 19068 | 4.90 | 200 | HTML 5, English |
16359 | fairwork.gov.au | 19069 | 4.90 | 200 | HTML 5, English |
16360 | lists.linuxfoundation.org | 19071 | 4.90 | 200 | No Lang |
16361 | bioconductor.org | 19072 | 4.90 | 200 | HTML 5, English |
16362 | thefirearmblog.com | 19074 | 4.90 | 200 | HTML 5, English |
16363 | marthastewartweddings.com | 19075 | 4.90 | 200 | HTML 5, English |
16364 | datpiff.com | 19076 | 4.90 | 200 | HTML 5, No Lang |
16365 | gitlab.manjaro.org | 19077 | 4.90 | 200 | HTML 5, English |
16366 | imagine.art | 19078 | 4.90 | 200 | HTML 5, English |
16367 | data.mendeley.com | 19079 | 4.90 | 200 | HTML 5, English |
16368 | titantv.com | 19080 | 4.90 | 200 | English |
16369 | video.cnbc.com | 19081 | 4.90 | 200 | HTML 5, English |
16370 | bgdailynews.com | 19082 | 4.90 | 200 | HTML 5, English |
16371 | ip-api.com | 19083 | 4.90 | 200 | HTML 5, English |
16372 | signeasy.com | 19084 | 4.90 | 200 | HTML 5, No Lang |
16373 | gtr.ukri.org | 19085 | 4.90 | 200 | HTML 5, English |
16374 | shafaq.com | 19086 | 4.90 | 200 | HTML 5 |
16375 | pubads.g.doubleclick.net | 19087 | 4.90 | 200 | HTML 5, English |
16376 | natureasia.com | 19088 | 4.90 | 200 | HTML 5, English |
16377 | kyobobook.co.kr | 19089 | 4.90 | 200 | HTML 5 |
16378 | destroyallsoftware.com | 19090 | 4.90 | 200 | HTML 5, No Lang |
16379 | sfdora.org | 19091 | 4.90 | 200 | HTML 5, English |
16380 | thenational.scot | 19092 | 4.90 | 200 | HTML 5, English |
16381 | cobo.com | 19093 | 4.90 | 200 | HTML 5, English |
16382 | burlingtonfreepress.com | 19094 | 4.90 | 200 | HTML 5, English |
16383 | zabytek.pl | 19095 | 4.90 | 200 | HTML 5, English |
16384 | hamariweb.com | 19096 | 4.90 | 200 | No Lang, Transitional |
16385 | readingrockets.org | 19097 | 4.90 | 200 | HTML 5, English |
16386 | praxistipps.chip.de | 19098 | 4.90 | 200 | HTML 5 |
16387 | shef.ac.uk | 19099 | 4.90 | 200 | HTML 5, English |
16388 | consumer.org.hk | 19100 | 4.90 | 200 | HTML 5 |
16389 | gardenandgun.com | 19101 | 4.90 | 200 | HTML 5, English |
16390 | everbank.com | 19102 | 4.90 | 200 | HTML 5, English |
16391 | freepressunlimited.org | 19103 | 4.90 | 200 | HTML 5, English |
16392 | istpravda.com.ua | 19104 | 4.90 | 200 | HTML 5, No Lang |
16393 | open.lib.umn.edu | 19105 | 4.90 | 200 | HTML 5, English |
16394 | cafonline.org | 19106 | 4.90 | 200 | HTML 5, English |
16395 | maps.googleblog.com | 19107 | 4.90 | 200 | HTML 5, English |
16396 | storm.mg | 19108 | 4.90 | 200 | HTML 5 |
16397 | sciencemediacentre.org | 19109 | 4.90 | 200 | HTML 5, English |
16398 | delpher.nl | 19110 | 4.90 | 200 | HTML 5 |
16399 | chir.ag | 19111 | 4.90 | 200 | No Lang |
16400 | kafka.apache.org | 19112 | 4.90 | 200 | No Lang, Strict |
Data from: Open PageRank