Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18501 | aintitcool.com | 21557 | 4.87 | 200 | HTML 5, English |
18502 | anime-expo.org | 21558 | 4.87 | 200 | HTML 5, English |
18503 | milliyet.com.tr | 21559 | 4.87 | 200 | HTML 5 |
18504 | drevo-info.ru | 21560 | 4.87 | 200 | HTML 5, No Lang |
18505 | dcs.warwick.ac.uk | 21561 | 4.87 | 200 | HTML 5, English |
18506 | gallery.menalto.com | 21562 | 4.87 | 200 | No Lang, Transitional |
18507 | source.opennews.org | 21563 | 4.87 | 200 | HTML 5, English |
18508 | ohwr.org | 21565 | 4.87 | 200 | HTML 5, English |
18509 | springboard.com | 21566 | 4.87 | 200 | HTML 5, English |
18510 | mefeedia.com | 21567 | 4.87 | 200 | HTML 5, No Lang |
18511 | hobbyking.com | 21568 | 4.87 | 200 | HTML 5, English |
18512 | zencastr.com | 21569 | 4.87 | 200 | HTML 5, English |
18513 | thegrio.com | 21570 | 4.87 | 200 | HTML 5, English |
18514 | goldplugins.com | 21571 | 4.87 | 200 | HTML 5, English |
18515 | goaffpro.com | 21572 | 4.87 | 200 | HTML 5, No Lang |
18516 | bruehl.de | 21573 | 4.87 | 200 | HTML 5 |
18517 | projects.csail.mit.edu | 21574 | 4.87 | 200 | HTML 5, English |
18518 | hpbn.co | 21575 | 4.87 | 200 | No Lang, Transitional |
18519 | geojson.io | 21576 | 4.87 | 200 | HTML 5, No Lang |
18520 | whitmanarchive.org | 21579 | 4.87 | 200 | HTML 5, English |
18521 | ibdb.com | 21580 | 4.87 | 200 | HTML 5, English |
18522 | prosite.expasy.org | 21581 | 4.87 | 200 | HTML 5, English |
18523 | secunet.com | 21582 | 4.87 | 200 | HTML 5 |
18524 | hattrick.org | 21583 | 4.87 | 200 | English, Transitional |
18525 | wcoomd.org | 21584 | 4.87 | 200 | HTML 5, English |
18526 | nec.com | 21586 | 4.87 | 200 | HTML 5, English |
18527 | pcg-random.org | 21587 | 4.87 | 200 | HTML 5, English |
18528 | diethood.com | 21588 | 4.87 | 200 | HTML 5, English |
18529 | slj.com | 21589 | 4.87 | 200 | No Lang, Transitional |
18530 | hrsonline.org | 21590 | 4.87 | 200 | HTML 5, English |
18531 | directferries.co.uk | 21591 | 4.87 | 200 | HTML 5, English |
18532 | csschopper.com | 21593 | 4.87 | 200 | HTML 5, English |
18533 | batimes.com.ar | 21594 | 4.87 | 200 | HTML 5, English |
18534 | klab.com | 21595 | 4.87 | 200 | HTML 5 |
18535 | catholicherald.co.uk | 21596 | 4.87 | 200 | HTML 5, English |
18536 | futuristarchitecture.com | 21597 | 4.87 | 200 | HTML 5, English |
18537 | auvergnerhonealpes.fr | 21598 | 4.87 | 200 | HTML 5 |
18538 | pima.bibliocommons.com | 21599 | 4.87 | 200 | English |
18539 | disneyanimation.com | 21600 | 4.87 | 200 | HTML 5, English |
18540 | freiepresse.de | 21601 | 4.87 | 200 | HTML 5 |
18541 | misc0110.net | 21602 | 4.87 | 200 | HTML 5, English |
18542 | aligntech.com | 21603 | 4.87 | 200 | HTML 5, No Lang |
18543 | quill.p3k.io | 21604 | 4.87 | 200 | HTML 5, English |
18544 | airnewzealand.co.nz | 21607 | 4.87 | 200 | HTML 5, English |
18545 | centerforhealthsecurity.org | 21608 | 4.87 | 200 | HTML 5, English |
18546 | clariontech.com | 21610 | 4.87 | 200 | HTML 5, English |
18547 | plugin-planet.com | 21611 | 4.87 | 200 | HTML 5, English |
18548 | google.co.ve | 21612 | 4.87 | 200 | HTML 5, English |
18549 | daisy.org | 21613 | 4.87 | 200 | HTML 5, English |
18550 | hive.apache.org | 21614 | 4.87 | 200 | HTML 5, No Lang |
18551 | mlir.llvm.org | 21616 | 4.87 | 200 | HTML 5, English |
18552 | educative.io | 21617 | 4.87 | 200 | HTML 5, English |
18553 | primo.getty.edu | 21618 | 4.87 | 200 | HTML 5, English |
18554 | cancergenome.nih.gov | 21620 | 4.87 | 200 | HTML 5, English |
18555 | ukad.org.uk | 21621 | 4.87 | 200 | HTML 5, English |
18556 | dremio.com | 21622 | 4.87 | 200 | HTML 5, English |
18557 | srtm.csi.cgiar.org | 21623 | 4.87 | 200 | HTML 5, English |
18558 | bleague.jp | 21624 | 4.87 | 200 | HTML 5 |
18559 | journal.classiccars.com | 21625 | 4.87 | 200 | HTML 5, English |
18560 | smartify.org | 21626 | 4.87 | 200 | HTML 5, English |
18561 | esb.ie | 21627 | 4.87 | 200 | HTML 5, English |
18562 | chiefmartec.com | 21628 | 4.87 | 200 | HTML 5, English |
18563 | itasoftware.com | 21629 | 4.87 | 200 | HTML 5, English |
18564 | monthlyreview.org | 21630 | 4.87 | 200 | HTML 5, English |
18565 | gspp.berkeley.edu | 21631 | 4.87 | 200 | HTML 5, English |
18566 | theincidentaleconomist.com | 21632 | 4.87 | 200 | HTML 5, No Lang |
18567 | georgerrmartin.com | 21633 | 4.87 | 200 | No Lang, Transitional |
18568 | russian.rt.com | 21634 | 4.87 | 200 | HTML 5 |
18569 | smartsupp.com | 21636 | 4.87 | 200 | HTML 5, English |
18570 | codesnippets.pro | 21637 | 4.87 | 200 | HTML 5, English |
18571 | lostechies.com | 21638 | 4.87 | 200 | HTML 5, English |
18572 | kokoanalytics.com | 21639 | 4.87 | 200 | HTML 5, English |
18573 | stat.ethz.ch | 21642 | 4.87 | 200 | HTML 5, English |
18574 | kuwaitairways.com | 21643 | 4.87 | 200 | English |
18575 | twinfinite.net | 21644 | 4.87 | 200 | HTML 5, English |
18576 | cartoonbrew.com | 21645 | 4.87 | 200 | HTML 5, English |
18577 | tooter.in | 21646 | 4.87 | 200 | HTML 5, English |
18578 | orcadian.co.uk | 21647 | 4.87 | 200 | HTML 5, English |
18579 | informahealthcare.com | 21648 | 4.87 | 200 | HTML 5, English |
18580 | aleagues.com.au | 21649 | 4.87 | 200 | HTML 5, English |
18581 | skyguide.ch | 21650 | 4.87 | 200 | HTML 5, English |
18582 | cemetech.net | 21651 | 4.87 | 200 | HTML 5, English |
18583 | inspection.canada.ca | 21652 | 4.87 | 200 | HTML 5, No Lang |
18584 | pic.twitter.com | 21653 | 4.87 | 200 | No Lang |
18585 | fi.wordpress.org | 21654 | 4.87 | 200 | HTML 5 |
18586 | emojitracker.com | 21655 | 4.87 | 200 | HTML 5, No Lang |
18587 | wyden.senate.gov | 21656 | 4.87 | 200 | HTML 5, English |
18588 | proto.io | 21657 | 4.87 | 200 | HTML 5, English |
18589 | handy.com | 21658 | 4.87 | 200 | HTML 5, English |
18590 | medium.datadriveninvestor.com | 21659 | 4.87 | 200 | HTML 5, No Lang |
18591 | billhartzer.com | 21661 | 4.87 | 200 | HTML 5, English |
18592 | foundation.wikimedia.org | 21663 | 4.87 | 200 | HTML 5, No Lang |
18593 | addevent.com | 21664 | 4.87 | 200 | HTML 5, English |
18594 | unimed.coop.br | 21665 | 4.87 | 200 | HTML 5 |
18595 | denver.cbslocal.com | 21666 | 4.87 | 200 | HTML 5, English |
18596 | land.copernicus.eu | 21667 | 4.87 | 200 | HTML 5, English |
18597 | monese.com | 21668 | 4.87 | 200 | HTML 5, English |
18598 | cloud.yandex.ru | 21671 | 4.87 | 200 | No Lang |
18599 | elexon.co.uk | 21672 | 4.87 | 200 | HTML 5, English |
18600 | ecologi.com | 21674 | 4.87 | 200 | HTML 5, English |
Data from: Open PageRank