Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19301 | thephiladelphiacitizen.org | 22511 | 4.86 | 200 | HTML 5, English |
19302 | evilmartians.com | 22512 | 4.86 | 200 | HTML 5, English |
19303 | atncorp.com | 22513 | 4.86 | 200 | HTML 5, No Lang |
19304 | houstonmethodist.org | 22514 | 4.86 | 200 | HTML 5, English |
19305 | auntieannes.com | 22515 | 4.86 | 200 | HTML 5, English |
19306 | quickemailverification.com | 22516 | 4.86 | 200 | HTML 5, English |
19307 | activemq.apache.org | 22517 | 4.86 | 200 | HTML 5, English |
19308 | oscar.go.com | 22518 | 4.86 | 200 | HTML 5, English |
19309 | open.library.ubc.ca | 22519 | 4.86 | 200 | HTML 5, English |
19310 | demo.sngine.com | 22521 | 4.86 | 200 | HTML 5, English |
19311 | alukah.net | 22522 | 4.86 | 200 | HTML 5 |
19312 | knowablemagazine.org | 22523 | 4.86 | 200 | HTML 5, No Lang |
19313 | fold.it | 22524 | 4.86 | 200 | HTML 5, English |
19314 | skinflint.co.uk | 22525 | 4.86 | 200 | HTML 5, English |
19315 | press.siemens.com | 22526 | 4.86 | 200 | HTML 5, English |
19316 | coachella.com | 22527 | 4.86 | 200 | HTML 5, English |
19317 | insights.ovid.com | 22528 | 4.86 | 200 | No Lang |
19318 | raymondcamden.com | 22529 | 4.86 | 200 | HTML 5, English |
19319 | pioneers.io | 22531 | 4.86 | 200 | HTML 5, English |
19320 | saferinternet.org.uk | 22532 | 4.86 | 200 | HTML 5, English |
19321 | uniteforliteracy.com | 22533 | 4.86 | 200 | HTML 5, English |
19322 | bc.ctvnews.ca | 22536 | 4.86 | 200 | HTML 5, English |
19323 | kvb.koeln | 22537 | 4.86 | 200 | HTML 5 |
19324 | bsr.org | 22538 | 4.86 | 200 | HTML 5, English |
19325 | metrics.torproject.org | 22539 | 4.86 | 200 | HTML 5, English |
19326 | framasoft.org | 22540 | 4.86 | 200 | HTML 5 |
19327 | fing.com | 22541 | 4.86 | 200 | HTML 5, English |
19328 | locusmag.com | 22542 | 4.86 | 200 | HTML 5, English |
19329 | consortiumnews.com | 22543 | 4.86 | 200 | HTML 5, English |
19330 | chromeos.dev | 22544 | 4.86 | 200 | HTML 5, No Lang |
19331 | uu.se | 22547 | 4.86 | 200 | HTML 5 |
19332 | parlamento.it | 22548 | 4.86 | 200 | Strict |
19333 | sustainability.aboutamazon.com | 22549 | 4.86 | 200 | HTML 5, English |
19334 | previewsworld.com | 22550 | 4.86 | 200 | HTML 5, English |
19335 | crowdfireapp.com | 22551 | 4.86 | 200 | HTML 5, No Lang |
19336 | digimorph.org | 22552 | 4.86 | 200 | No Lang |
19337 | childrenshealthdefense.org | 22553 | 4.86 | 200 | HTML 5, English |
19338 | offcourse.co | 22554 | 4.86 | 200 | HTML 5, English |
19339 | bitrebels.com | 22555 | 4.86 | 200 | HTML 5, English |
19340 | siliconera.com | 22556 | 4.86 | 200 | HTML 5, English |
19341 | norway.no | 22557 | 4.86 | 200 | HTML 5, English |
19342 | congstar.de | 22558 | 4.86 | 200 | HTML 5 |
19343 | ccm.net | 22559 | 4.86 | 200 | English |
19344 | dogtime.com | 22560 | 4.86 | 200 | HTML 5, English |
19345 | maplight.org | 22562 | 4.86 | 200 | HTML 5, English |
19346 | squirrel-news.net | 22563 | 4.86 | 200 | HTML 5, English |
19347 | phonak.com | 22564 | 4.86 | 200 | HTML 5, English |
19348 | ccf.org.cn | 22565 | 4.86 | 200 | HTML 5 |
19349 | lesinrocks.com | 22566 | 4.86 | 200 | HTML 5 |
19350 | 3sat.de | 22567 | 4.86 | 200 | HTML 5 |
19351 | korean.visitkorea.or.kr | 22568 | 4.86 | 200 | HTML 5 |
19352 | exoplanetarchive.ipac.caltech.edu | 22569 | 4.86 | 200 | HTML 5, No Lang |
19353 | awm.gov.au | 22570 | 4.86 | 200 | HTML 5, English |
19354 | ncpgambling.org | 22571 | 4.86 | 200 | HTML 5, English |
19355 | clubofrome.org | 22572 | 4.86 | 200 | HTML 5, English |
19356 | ccsenet.org | 22573 | 4.86 | 200 | HTML 5, English |
19357 | aqara.com | 22575 | 4.86 | 200 | HTML 5, English |
19358 | poynton.com | 22576 | 4.86 | 200 | HTML 5, English |
19359 | uefi.org | 22577 | 4.86 | 200 | HTML 5, English |
19360 | wahl-o-mat.de | 22578 | 4.86 | 200 | HTML 5 |
19361 | juser.fz-juelich.de | 22579 | 4.86 | 200 | English, Transitional |
19362 | segmentfault.com | 22580 | 4.86 | 200 | HTML 5 |
19363 | worksinprogress.co | 22582 | 4.86 | 200 | HTML 5, English |
19364 | education.govt.nz | 22583 | 4.86 | 200 | HTML 5, English |
19365 | dos.ny.gov | 22585 | 4.86 | 200 | HTML 5, English |
19366 | wisn.com | 22588 | 4.86 | 200 | HTML 5, English |
19367 | android.stackexchange.com | 22589 | 4.86 | 200 | HTML 5, English |
19368 | euro-fusion.org | 22590 | 4.86 | 200 | HTML 5, English |
19369 | luxafor.com | 22591 | 4.86 | 200 | HTML 5, English |
19370 | extension.umaine.edu | 22592 | 4.86 | 200 | HTML 5, English |
19371 | cair.com | 22593 | 4.86 | 200 | HTML 5, English |
19372 | fox4news.com | 22595 | 4.86 | 200 | HTML 5, English |
19373 | ucc.org | 22596 | 4.86 | 200 | HTML 5, English |
19374 | ankidroid.org | 22597 | 4.86 | 200 | HTML 5, English |
19375 | kenya-airways.com | 22598 | 4.86 | 200 | HTML 5, English |
19376 | nghttp2.org | 22599 | 4.86 | 200 | HTML 5, No Lang |
19377 | fcc-fac.ca | 22600 | 4.86 | 200 | HTML 5, English |
19378 | phunware.com | 22602 | 4.86 | 200 | HTML 5, English |
19379 | articles.bplans.com | 22603 | 4.86 | 200 | HTML 5, English |
19380 | ascii.textfiles.com | 22604 | 4.86 | 200 | HTML 5, English |
19381 | thinkwithportals.com | 22605 | 4.86 | 200 | HTML 5, No Lang |
19382 | puri.sm | 22606 | 4.86 | 200 | HTML 5, English |
19383 | mydoterra.com | 22607 | 4.86 | 200 | No Lang |
19384 | seesaawiki.jp | 22608 | 4.86 | 200 | HTML 5 |
19385 | micro.magnet.fsu.edu | 22609 | 4.86 | 200 | No Lang |
19386 | generations.fr | 22610 | 4.86 | 200 | HTML 5 |
19387 | analytics.googleblog.com | 22611 | 4.86 | 200 | HTML 5, English |
19388 | australiangeographic.com.au | 22612 | 4.86 | 200 | HTML 5, English |
19389 | cin.ufpe.br | 22614 | 4.86 | 200 | HTML 5 |
19390 | fargomoorhead.org | 22616 | 4.86 | 200 | HTML 5, English |
19391 | dailyprincetonian.com | 22617 | 4.86 | 200 | HTML 5, English |
19392 | laurenconrad.com | 22618 | 4.86 | 200 | HTML 5, English |
19393 | wikis.world | 22619 | 4.86 | 200 | HTML 5, English |
19394 | axonaut.com | 22621 | 4.86 | 200 | HTML 5 |
19395 | publicpolicypolling.com | 22622 | 4.86 | 200 | HTML 5, No Lang |
19396 | clario.co | 22623 | 4.86 | 200 | HTML 5, English |
19397 | opengraphprotocol.org | 22624 | 4.86 | 200 | HTML 5, No Lang |
19398 | bokus.com | 22625 | 4.86 | 200 | HTML 5 |
19399 | travellerspoint.com | 22626 | 4.86 | 200 | HTML 5, English |
19400 | surgeongeneral.gov | 22628 | 4.86 | 200 | HTML 5, English |
Data from: Open PageRank