Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19401 | w2.vatican.va | 22629 | 4.86 | 200 | HTML 5, No Lang |
19402 | deskera.com | 22630 | 4.86 | 200 | HTML 5, English |
19403 | gestion.pe | 22631 | 4.86 | 200 | HTML 5 |
19404 | desktop.arcgis.com | 22632 | 4.86 | 200 | HTML 5, No Lang |
19405 | kristenarnett.com | 22633 | 4.86 | 200 | HTML 5, English |
19406 | dart.fss.or.kr | 22634 | 4.86 | 200 | HTML 5 |
19407 | pella.com | 22635 | 4.86 | 200 | HTML 5, English |
19408 | aptbasilicata.it | 22636 | 4.86 | 200 | HTML 5 |
19409 | wiki.lyrasis.org | 22637 | 4.86 | 200 | HTML 5, English |
19410 | johndcook.com | 22638 | 4.86 | 200 | HTML 5, English |
19411 | vagas.com.br | 22639 | 4.86 | 200 | HTML 5 |
19412 | alessioatzeni.com | 22640 | 4.86 | 200 | HTML 5, English |
19413 | farooqkperogi.com | 22641 | 4.86 | 200 | HTML 5, No Lang |
19414 | luisazhou.com | 22644 | 4.86 | 200 | HTML 5, English |
19415 | itthinx.com | 22645 | 4.86 | 200 | HTML 5, English |
19416 | internethalloffame.org | 22647 | 4.86 | 200 | HTML 5, English |
19417 | mynoise.net | 22648 | 4.86 | 200 | HTML 5, English |
19418 | adambrown.info | 22649 | 4.86 | 200 | HTML 5, English |
19419 | toicon.com | 22650 | 4.86 | 200 | HTML 5, English |
19420 | help.hcltechsw.com | 22651 | 4.86 | 200 | HTML 5, English |
19421 | lancasteronline.com | 22652 | 4.86 | 200 | HTML 5, English |
19422 | wisdmlabs.com | 22653 | 4.86 | 200 | HTML 5, English |
19423 | asanet.org | 22655 | 4.86 | 200 | HTML 5, English |
19424 | plumelabs.com | 22656 | 4.86 | 200 | HTML 5, No Lang |
19425 | geoapify.com | 22657 | 4.86 | 200 | HTML 5, English |
19426 | userpilot.com | 22658 | 4.86 | 200 | HTML 5, English |
19427 | sapere.it | 22659 | 4.86 | 200 | HTML 5 |
19428 | nielseniq.com | 22660 | 4.86 | 200 | HTML 5, English |
19429 | wiki.winehq.org | 22661 | 4.86 | 200 | HTML 5, English |
19430 | thunderforest.com | 22662 | 4.86 | 200 | HTML 5, English |
19431 | biola.edu | 22663 | 4.86 | 200 | HTML 5, English |
19432 | centrodedescargas.cnig.es | 22664 | 4.86 | 200 | Transitional |
19433 | cnews.fr | 22665 | 4.86 | 200 | HTML 5 |
19434 | radikal.com.tr | 22666 | 4.86 | 200 | |
19435 | militar.org.ua | 22667 | 4.86 | 200 | HTML 5 |
19436 | bugs.java.com | 22668 | 4.86 | 200 | No Lang |
19437 | aasmnet.org | 22670 | 4.86 | 200 | HTML 5, English |
19438 | marshall.com | 22672 | 4.86 | 200 | HTML 5, English |
19439 | sdn.sap.com | 22673 | 4.86 | 200 | HTML 5, English |
19440 | visiblebody.com | 22674 | 4.86 | 200 | HTML 5, English |
19441 | itunesconnect.apple.com | 22675 | 4.86 | 200 | HTML 5, No Lang |
19442 | tinybuddha.com | 22676 | 4.86 | 200 | HTML 5, English |
19443 | farmersjournal.ie | 22677 | 4.86 | 200 | HTML 5, English |
19444 | nurx.com | 22678 | 4.86 | 200 | HTML 5, English |
19445 | ihs.gov | 22680 | 4.86 | 200 | HTML 5, English |
19446 | note.youdao.com | 22681 | 4.86 | 200 | HTML 5, English |
19447 | natgeotv.com | 22682 | 4.86 | 200 | HTML 5, English |
19448 | haproxy.org | 22683 | 4.86 | 200 | No Lang |
19449 | allscripts.com | 22684 | 4.86 | 200 | HTML 5, English |
19450 | vectorunit.com | 22685 | 4.86 | 200 | HTML 5, English |
19451 | crunchify.com | 22686 | 4.86 | 200 | HTML 5, English |
19452 | security-insider.de | 22687 | 4.86 | 200 | HTML 5 |
19453 | sanskrit-lexicon.uni-koeln.de | 22688 | 4.86 | 200 | HTML 5, No Lang |
19454 | traveliowa.com | 22689 | 4.86 | 200 | HTML 5, English |
19455 | promobil.de | 22691 | 4.86 | 200 | HTML 5 |
19456 | assemblyai.com | 22692 | 4.86 | 200 | HTML 5, English |
19457 | pantip.com | 22693 | 4.86 | 200 | HTML 5 |
19458 | schengenvisainfo.com | 22694 | 4.86 | 200 | HTML 5, English |
19459 | dataliberation.org | 22695 | 4.86 | 200 | HTML 5, English |
19460 | schlager.de | 22696 | 4.86 | 200 | HTML 5 |
19461 | unclaimedbaggage.com | 22697 | 4.86 | 200 | HTML 5, English |
19462 | telerama.fr | 22699 | 4.86 | 200 | HTML 5 |
19463 | bcbg.com | 22700 | 4.86 | 200 | HTML 5, English |
19464 | lyricstranslate.com | 22701 | 4.86 | 200 | HTML 5, English |
19465 | alamedaca.gov | 22702 | 4.86 | 200 | HTML 5, English |
19466 | business-review.eu | 22703 | 4.86 | 200 | HTML 5 |
19467 | futuremedicine.com | 22705 | 4.86 | 200 | HTML 5, No Lang |
19468 | perlmonks.org | 22706 | 4.86 | 200 | English, Transitional |
19469 | manos.malihu.gr | 22707 | 4.86 | 200 | HTML 5, English |
19470 | filfre.net | 22708 | 4.86 | 200 | HTML 5, English |
19471 | qiskit.org | 22709 | 4.86 | 200 | HTML 5, English |
19472 | mactech.com | 22710 | 4.86 | 200 | HTML 5, English |
19473 | gjsentinel.com | 22712 | 4.86 | 200 | HTML 5, English |
19474 | postaffiliatepro.com | 22713 | 4.86 | 200 | HTML 5, English |
19475 | rudderstack.com | 22715 | 4.86 | 200 | HTML 5, English |
19476 | irp.cdn-website.com | 22716 | 4.86 | 200 | No Lang |
19477 | deckerweb.de | 22717 | 4.86 | 200 | Transitional |
19478 | laopinion.com | 22718 | 4.86 | 200 | HTML 5 |
19479 | teachingamericanhistory.org | 22719 | 4.86 | 200 | HTML 5, English |
19480 | bathnes.gov.uk | 22720 | 4.86 | 200 | HTML 5, English |
19481 | shop.fender.com | 22721 | 4.86 | 200 | English |
19482 | harney.com | 22722 | 4.86 | 200 | HTML 5, English |
19483 | evgo.com | 22723 | 4.86 | 200 | HTML 5, English |
19484 | doe.mass.edu | 22724 | 4.86 | 200 | English |
19485 | insights.dice.com | 22725 | 4.86 | 200 | HTML 5, English |
19486 | playwright.dev | 22726 | 4.86 | 200 | HTML 5, English |
19487 | richarddawkins.net | 22727 | 4.86 | 200 | HTML 5, English |
19488 | socialstudies.org | 22728 | 4.86 | 200 | No Lang |
19489 | naoko.blog | 22729 | 4.86 | 200 | HTML 5, English |
19490 | maps.google.ru | 22730 | 4.86 | 200 | HTML 5, English |
19491 | turismo.ra.it | 22731 | 4.86 | 200 | HTML 5 |
19492 | paulrobertlloyd.com | 22732 | 4.86 | 200 | HTML 5, English |
19493 | edgemedianetwork.com | 22733 | 4.86 | 200 | HTML 5, English |
19494 | southpole.com | 22734 | 4.86 | 200 | HTML 5, English |
19495 | webgilde.com | 22735 | 4.86 | 200 | HTML 5, English |
19496 | utsouthwestern.edu | 22736 | 4.86 | 200 | HTML 5, English |
19497 | canarymedia.com | 22737 | 4.86 | 200 | HTML 5, English |
19498 | universia.es | 22738 | 4.86 | 200 | No Lang |
19499 | upplandsvasby.se | 22739 | 4.86 | 200 | HTML 5 |
19500 | krafton.com | 22740 | 4.86 | 200 | HTML 5 |
Data from: Open PageRank