Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
17501 | trinet.com | 20408 | 4.88 | 200 | HTML 5, English |
17502 | gahetna.nl | 20409 | 4.88 | 200 | HTML 5 |
17503 | dhiratara.com | 20410 | 4.88 | 200 | HTML 5 |
17504 | historichotels.org | 20411 | 4.88 | 200 | HTML 5, English |
17505 | pocketbitcoin.com | 20414 | 4.88 | 200 | HTML 5, English |
17506 | kith.com | 20415 | 4.88 | 200 | HTML 5, English |
17507 | finehomebuilding.com | 20416 | 4.88 | 200 | HTML 5, English |
17508 | pandorafms.com | 20418 | 4.88 | 200 | HTML 5, English |
17509 | law.uchicago.edu | 20419 | 4.88 | 200 | HTML 5, English |
17510 | savetheelephants.org | 20421 | 4.88 | 200 | HTML 5, English |
17511 | lexus.com | 20422 | 4.88 | 200 | No Lang |
17512 | thetab.com | 20424 | 4.88 | 200 | HTML 5, English |
17513 | tallink.com | 20426 | 4.88 | 200 | HTML 5, English |
17514 | ultius.com | 20427 | 4.88 | 200 | HTML 5, English |
17515 | cred.club | 20428 | 4.88 | 200 | HTML 5, No Lang |
17516 | nshealth.ca | 20429 | 4.88 | 200 | HTML 5, English |
17517 | monticello.org | 20430 | 4.88 | 200 | HTML 5, English |
17518 | maynoothuniversity.ie | 20432 | 4.88 | 200 | English |
17519 | certbot.eff.org | 20433 | 4.88 | 200 | HTML 5, No Lang |
17520 | unos.org | 20434 | 4.88 | 200 | HTML 5, English |
17521 | larepublica.pe | 20436 | 4.88 | 200 | HTML 5 |
17522 | shimo.im | 20438 | 4.88 | 200 | HTML 5, English |
17523 | dmm.co.jp | 20439 | 4.88 | 200 | HTML 5 |
17524 | electricliterature.com | 20440 | 4.88 | 200 | HTML 5, English |
17525 | buchmesse.de | 20441 | 4.88 | 200 | HTML 5 |
17526 | support.f5.com | 20442 | 4.88 | 200 | HTML 5, English |
17527 | soleretriever.com | 20443 | 4.88 | 200 | HTML 5, English |
17528 | phmc.pa.gov | 20444 | 4.88 | 200 | HTML 5, English |
17529 | thedivinemercy.org | 20445 | 4.88 | 200 | English |
17530 | transitionnetwork.org | 20446 | 4.88 | 200 | HTML 5, English |
17531 | aofoundation.org | 20447 | 4.88 | 200 | HTML 5, No Lang |
17532 | marketplace.magento.com | 20449 | 4.88 | 200 | HTML 5, English |
17533 | eesc.europa.eu | 20450 | 4.88 | 200 | HTML 5, English |
17534 | libdems.org.uk | 20452 | 4.88 | 200 | HTML 5, English |
17535 | registerguard.com | 20453 | 4.88 | 200 | HTML 5, English |
17536 | gothiacup.se | 20454 | 4.88 | 200 | HTML 5, English |
17537 | jhr.uwpress.org | 20455 | 4.88 | 200 | HTML 5, English |
17538 | techon.nikkeibp.co.jp | 20458 | 4.88 | 200 | HTML 5 |
17539 | booktopia.com.au | 20459 | 4.88 | 200 | HTML 5, English |
17540 | laughteryoga.org | 20460 | 4.88 | 200 | HTML 5, English |
17541 | news.ubc.ca | 20461 | 4.88 | 200 | HTML 5, English |
17542 | rcgroups.com | 20462 | 4.88 | 200 | English, Transitional |
17543 | math.sci.hiroshima-u.ac.jp | 20463 | 4.88 | 200 | No Lang, Transitional |
17544 | gekirock.com | 20464 | 4.88 | 200 | HTML 5, No Lang |
17545 | vm.ee | 20466 | 4.88 | 200 | HTML 5 |
17546 | rethinkrobotics.com | 20467 | 4.88 | 200 | HTML 5, English |
17547 | home.bt.com | 20470 | 4.88 | 200 | HTML 5, English |
17548 | jquery.malsup.com | 20473 | 4.88 | 200 | HTML 5, English |
17549 | onjava.com | 20474 | 4.88 | 200 | HTML 5, English |
17550 | americanalpineclub.org | 20475 | 4.88 | 200 | HTML 5, English |
17551 | snapon.com | 20476 | 4.88 | 200 | HTML 5, No Lang |
17552 | sjc.sp.gov.br | 20477 | 4.88 | 200 | HTML 5 |
17553 | wccftech.com | 20478 | 4.88 | 200 | HTML 5, English |
17554 | blueorigin.com | 20480 | 4.88 | 200 | HTML 5, English |
17555 | caseih.com | 20481 | 4.88 | 200 | No Lang |
17556 | rp.pl | 20482 | 4.88 | 200 | HTML 5 |
17557 | wplook.com | 20483 | 4.88 | 200 | HTML 5, English |
17558 | appdefensealliance.dev | 20484 | 4.88 | 200 | HTML 5, English |
17559 | seabreeze.com.au | 20485 | 4.88 | 200 | HTML 5, English |
17560 | theses.gla.ac.uk | 20486 | 4.88 | 200 | HTML 5, English |
17561 | mixergy.com | 20487 | 4.88 | 200 | HTML 5, English |
17562 | divany.hu | 20488 | 4.88 | 200 | HTML 5 |
17563 | meny.no | 20489 | 4.88 | 200 | HTML 5 |
17564 | timelyapp.com | 20490 | 4.88 | 200 | HTML 5, English |
17565 | buildyourfuture.withgoogle.com | 20491 | 4.88 | 200 | HTML 5, English |
17566 | belfercenter.org | 20492 | 4.88 | 200 | HTML 5, English |
17567 | sfexaminer.com | 20493 | 4.88 | 200 | HTML 5, English |
17568 | gazetaprawna.pl | 20494 | 4.88 | 200 | HTML 5 |
17569 | news.slashdot.org | 20495 | 4.88 | 200 | English |
17570 | myabandonware.com | 20496 | 4.88 | 200 | HTML 5, English |
17571 | mother.ly | 20497 | 4.88 | 200 | HTML 5, English |
17572 | biglots.com | 20498 | 4.88 | 200 | HTML 5, English |
17573 | en.wikifur.com | 20499 | 4.88 | 200 | HTML 5, English |
17574 | designkit.org | 20500 | 4.88 | 200 | HTML 5, No Lang |
17575 | samaritanspurse.org | 20501 | 4.88 | 200 | English, Transitional |
17576 | account.mapbox.com | 20502 | 4.88 | 200 | HTML 5, English |
17577 | en.mehrnews.com | 20503 | 4.88 | 200 | HTML 5, English |
17578 | thebookerprizes.com | 20504 | 4.88 | 200 | HTML 5, English |
17579 | catholic.org | 20505 | 4.88 | 200 | HTML 5, English |
17580 | user.it.uu.se | 20506 | 4.88 | 200 | No Lang |
17581 | medical.nema.org | 20507 | 4.88 | 200 | HTML 5, English |
17582 | marklogic.com | 20508 | 4.88 | 200 | HTML 5, English |
17583 | iprima.cz | 20509 | 4.88 | 200 | HTML 5 |
17584 | gallaudet.edu | 20510 | 4.88 | 200 | HTML 5, English |
17585 | portal.opengeospatial.org | 20511 | 4.88 | 200 | No Lang |
17586 | secure.helpscout.net | 20512 | 4.88 | 200 | HTML 5, English |
17587 | factmag.com | 20517 | 4.88 | 200 | HTML 5, English |
17588 | nscorp.com | 20518 | 4.88 | 200 | HTML 5, English |
17589 | autoblog.nl | 20519 | 4.88 | 200 | HTML 5 |
17590 | pencil2d.org | 20520 | 4.88 | 200 | HTML 5, English |
17591 | regonline.com | 20522 | 4.88 | 200 | HTML 5, English |
17592 | ej.uz | 20523 | 4.88 | 200 | HTML 5 |
17593 | falstaff.at | 20524 | 4.88 | 200 | HTML 5 |
17594 | websitemagazine.com | 20525 | 4.88 | 200 | HTML 5, English |
17595 | news.opensuse.org | 20526 | 4.88 | 200 | HTML 5, English |
17596 | indieauth.com | 20527 | 4.88 | 200 | HTML 5, English |
17597 | sjofartsverket.se | 20528 | 4.88 | 200 | HTML 5 |
17598 | audacious-media-player.org | 20529 | 4.88 | 200 | HTML 5, English |
17599 | boomplay.com | 20530 | 4.88 | 200 | HTML 5, English |
17600 | tfr.faa.gov | 20531 | 4.88 | 200 | HTML 5, English |
Data from: Open PageRank