Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
20201 | nyunews.com | 23584 | 4.84 | 200 | HTML 5, English |
20202 | cloud.googleblog.com | 23585 | 4.84 | 200 | HTML 5, English |
20203 | amp.scmp.com | 23586 | 4.84 | 200 | HTML 5, English |
20204 | w2.eff.org | 23587 | 4.84 | 200 | HTML 5, No Lang |
20205 | aristath.github.io | 23588 | 4.84 | 200 | HTML 5, English |
20206 | mediaron.com | 23589 | 4.84 | 200 | HTML 5, English |
20207 | everytimezone.com | 23591 | 4.84 | 200 | HTML 5, No Lang |
20208 | abdussamad.com | 23592 | 4.84 | 200 | HTML 5, English |
20209 | unmultimedia.org | 23593 | 4.84 | 200 | HTML 5, English |
20210 | developer.infusionsoft.com | 23594 | 4.84 | 200 | HTML 5, English |
20211 | the-digital-reader.com | 23596 | 4.84 | 200 | HTML 5, English |
20212 | mastersofscale.com | 23597 | 4.84 | 200 | HTML 5, English |
20213 | wordproof.com | 23599 | 4.84 | 200 | HTML 5, English |
20214 | gnunet.org | 23601 | 4.84 | 200 | HTML 5, English |
20215 | techno-science.net | 23602 | 4.84 | 200 | HTML 5 |
20216 | polk-county.net | 23604 | 4.84 | 200 | HTML 5, English |
20217 | symmetrymagazine.org | 23605 | 4.84 | 200 | HTML 5, English |
20218 | stooq.com | 23606 | 4.84 | 200 | No Lang |
20219 | citizen.jp | 23607 | 4.84 | 200 | HTML 5 |
20220 | hotelurbano.com | 23609 | 4.84 | 200 | HTML 5, English |
20221 | shang.qq.com | 23610 | 4.84 | 200 | HTML 5, No Lang |
20222 | kidsactivitiesblog.com | 23611 | 4.84 | 200 | HTML 5, English |
20223 | wondermark.com | 23612 | 4.84 | 200 | HTML 5, English |
20224 | money.msn.com | 23613 | 4.84 | 200 | HTML 5, English |
20225 | seruniversitario.com.br | 23614 | 4.84 | 200 | |
20226 | iis.fraunhofer.de | 23615 | 4.84 | 200 | HTML 5 |
20227 | stoneisland.com | 23617 | 4.84 | 200 | HTML 5, English |
20228 | familytreemaker.com | 23618 | 4.84 | 200 | No Lang |
20229 | qdl.qa | 23619 | 4.84 | 200 | HTML 5, English |
20230 | shacknews.com | 23620 | 4.84 | 200 | HTML 5, English |
20231 | webbtelescope.org | 23621 | 4.84 | 200 | HTML 5, English |
20232 | penntoday.upenn.edu | 23622 | 4.84 | 200 | HTML 5, English |
20233 | linux.slashdot.org | 23625 | 4.84 | 200 | English |
20234 | chrisfinke.com | 23626 | 4.84 | 200 | HTML 5, English |
20235 | kathyisawesome.com | 23627 | 4.84 | 200 | HTML 5, English |
20236 | everythingfonts.com | 23628 | 4.84 | 200 | HTML 5, English |
20237 | italia.it | 23629 | 4.84 | 200 | HTML 5, English |
20238 | geoffrey.crofte.fr | 23630 | 4.84 | 200 | HTML 5, No Lang |
20239 | thefragens.com | 23632 | 4.84 | 200 | HTML 5, English |
20240 | emergencemagazine.org | 23633 | 4.84 | 200 | HTML 5, English |
20241 | buayacorp.com | 23634 | 4.84 | 200 | HTML 5 |
20242 | library.cornell.edu | 23640 | 4.84 | 200 | HTML 5, English |
20243 | spitzer.caltech.edu | 23641 | 4.84 | 200 | HTML 5, No Lang |
20244 | secure.ssa.gov | 23642 | 4.84 | 200 | No Lang |
20245 | paymoapp.com | 23644 | 4.84 | 200 | HTML 5, English |
20246 | browserext.github.io | 23646 | 4.84 | 200 | HTML 5, English |
20247 | flybase.org | 23647 | 4.84 | 200 | HTML 5, English |
20248 | petinsurance.com | 23648 | 4.84 | 200 | HTML 5, English |
20249 | thestatesman.com | 23649 | 4.84 | 200 | HTML 5, English |
20250 | woebothealth.com | 23651 | 4.84 | 200 | HTML 5, English |
20251 | ip2location.io | 23652 | 4.84 | 200 | HTML 5, English |
20252 | felissimo.co.jp | 23653 | 4.84 | 200 | HTML 5 |
20253 | unite.ai | 23654 | 4.84 | 200 | HTML 5, English |
20254 | devarticles.com | 23655 | 4.84 | 200 | HTML 5, No Lang |
20255 | dailyforex.com | 23656 | 4.84 | 200 | HTML 5, English |
20256 | museot.fi | 23657 | 4.84 | 200 | |
20257 | itsupportguides.com | 23660 | 4.84 | 200 | HTML 5, English |
20258 | webhostingw.com | 23661 | 4.84 | 200 | HTML 5, English |
20259 | helen.blog | 23662 | 4.84 | 200 | HTML 5, English |
20260 | cision.com | 23663 | 4.84 | 200 | HTML 5, English |
20261 | legrand.com | 23664 | 4.84 | 200 | HTML 5, English |
20262 | 98fm.com | 23665 | 4.84 | 200 | HTML 5, English |
20263 | aura.com | 23666 | 4.84 | 200 | HTML 5, English |
20264 | opsi.gov.uk | 23667 | 4.84 | 200 | English |
20265 | tempsreel.nouvelobs.com | 23668 | 4.84 | 200 | HTML 5 |
20266 | sonymusic.com | 23669 | 4.84 | 200 | HTML 5, English |
20267 | simon.com | 23670 | 4.84 | 200 | HTML 5, English |
20268 | henryjenkins.org | 23671 | 4.84 | 200 | HTML 5, English |
20269 | mackaycartoons.net | 23672 | 4.84 | 200 | HTML 5, English |
20270 | zpe.gov.pl | 23673 | 4.84 | 200 | HTML 5 |
20271 | ettoday.net | 23674 | 4.84 | 200 | HTML 5 |
20272 | liferay.com | 23675 | 4.84 | 200 | HTML 5, English |
20273 | confluent.io | 23676 | 4.84 | 200 | HTML 5, English |
20274 | talk.ictvonline.org | 23677 | 4.84 | 200 | HTML 5, English |
20275 | around.com | 23678 | 4.84 | 200 | HTML 5, English |
20276 | cloze.com | 23679 | 4.84 | 200 | HTML 5, English |
20277 | t-mobile.pl | 23680 | 4.84 | 200 | HTML 5 |
20278 | sysdig.com | 23681 | 4.84 | 200 | HTML 5, English |
20279 | wpautolistings.com | 23682 | 4.84 | 200 | HTML 5, English |
20280 | string-db.org | 23683 | 4.84 | 200 | HTML 5, English |
20281 | keraweb.nl | 23685 | 4.84 | 200 | HTML 5 |
20282 | dan.com | 23686 | 4.84 | 200 | HTML 5, No Lang |
20283 | theartstory.org | 23687 | 4.84 | 200 | HTML 5, No Lang |
20284 | docs.ewww.io | 23688 | 4.84 | 200 | HTML 5, No Lang |
20285 | 411mania.com | 23689 | 4.84 | 200 | HTML 5, English |
20286 | www-1.ibm.com | 23690 | 4.84 | 200 | HTML 5, English |
20287 | lloc.de | 23691 | 4.84 | 200 | HTML 5 |
20288 | code-atlantic.com | 23692 | 4.84 | 200 | HTML 5, English |
20289 | tiptoppress.com | 23693 | 4.84 | 200 | HTML 5, English |
20290 | ib.berkeley.edu | 23695 | 4.84 | 200 | HTML 5, English |
20291 | ljmu.ac.uk | 23697 | 4.84 | 200 | HTML 5, English |
20292 | boxicons.com | 23698 | 4.84 | 200 | HTML 5, No Lang |
20293 | wingsforlifeworldrun.com | 23699 | 4.84 | 200 | HTML 5, English |
20294 | pronamic.eu | 23700 | 4.84 | 200 | HTML 5, English |
20295 | foodallergy.org | 23701 | 4.84 | 200 | HTML 5, English |
20296 | scte.org | 23702 | 4.84 | 200 | HTML 5, English |
20297 | voyager.jpl.nasa.gov | 23703 | 4.84 | 200 | HTML 5, English |
20298 | yakimaherald.com | 23704 | 4.84 | 200 | HTML 5, English |
20299 | book.interpark.com | 23705 | 4.84 | 200 | No Lang |
20300 | vice-emu.sourceforge.net | 23707 | 4.84 | 200 | English, Strict |
Data from: Open PageRank