Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
5601 | phoronix.com | 6593 | 5.25 | 200 | HTML 5, English |
5602 | broadinstitute.org | 6594 | 5.25 | 200 | HTML 5, English |
5603 | lacma.org | 6595 | 5.25 | 200 | HTML 5, English |
5604 | zdnet.fr | 6596 | 5.25 | 200 | HTML 5 |
5605 | tw.news.yahoo.com | 6597 | 5.25 | 200 | HTML 5 |
5606 | bloomingdales.com | 6598 | 5.25 | 200 | HTML 5, English |
5607 | nyti.ms | 6599 | 5.25 | 200 | HTML 5, English |
5608 | ftp.gnome.org | 6600 | 5.25 | 200 | HTML 5, No Lang |
5609 | jiosaavn.com | 6601 | 5.25 | 200 | HTML 5, English |
5610 | afb.org | 6603 | 5.25 | 200 | HTML 5, English |
5611 | unige.ch | 6604 | 5.25 | 200 | HTML 5 |
5612 | zooniverse.org | 6605 | 5.25 | 200 | HTML 5, English |
5613 | swisscom.ch | 6606 | 5.25 | 200 | HTML 5, English |
5614 | apps.irs.gov | 6607 | 5.25 | 200 | HTML 5, English |
5615 | zenwriting.net | 6608 | 5.25 | 200 | HTML 5, English |
5616 | espressif.com | 6609 | 5.25 | 200 | English |
5617 | barchart.com | 6610 | 5.25 | 200 | HTML 5, English |
5618 | pastelink.net | 6611 | 5.25 | 200 | HTML 5, English |
5619 | world.openfoodfacts.org | 6612 | 5.25 | 200 | English |
5620 | chron.com | 6613 | 5.25 | 200 | HTML 5, English |
5621 | docs.newrelic.com | 6614 | 5.25 | 200 | HTML 5, English |
5622 | useloom.com | 6615 | 5.25 | 200 | HTML 5, English |
5623 | copernicus.eu | 6616 | 5.25 | 200 | HTML 5, English |
5624 | oaic.gov.au | 6617 | 5.25 | 200 | HTML 5, English |
5625 | goqr.me | 6619 | 5.25 | 200 | HTML 5, No Lang |
5626 | blog.taxjar.com | 6620 | 5.25 | 200 | HTML 5, English |
5627 | hidive.com | 6621 | 5.25 | 200 | HTML 5, No Lang |
5628 | starz.com | 6623 | 5.25 | 200 | HTML 5, English |
5629 | harman.com | 6624 | 5.25 | 200 | HTML 5, English |
5630 | reviews.io | 6625 | 5.25 | 200 | HTML 5, English |
5631 | uk.pinterest.com | 6626 | 5.25 | 200 | HTML 5, English |
5632 | siteresources.worldbank.org | 6627 | 5.25 | 200 | No Lang, Transitional |
5633 | ocf.berkeley.edu | 6628 | 5.25 | 200 | HTML 5, No Lang |
5634 | isc.org | 6629 | 5.25 | 200 | HTML 5, English |
5635 | qwant.com | 6631 | 5.25 | 200 | HTML 5, English |
5636 | courthousenews.com | 6632 | 5.25 | 200 | HTML 5, English |
5637 | radiofrance.fr | 6633 | 5.25 | 200 | HTML 5 |
5638 | scirp.org | 6634 | 5.25 | 200 | No Lang, Transitional |
5639 | data.gov.hk | 6635 | 5.25 | 200 | HTML 5, English |
5640 | setmore.com | 6636 | 5.25 | 200 | HTML 5, English |
5641 | gamasutra.com | 6637 | 5.25 | 200 | HTML 5, English |
5642 | yr.no | 6638 | 5.25 | 200 | HTML 5 |
5643 | mailtrap.io | 6639 | 5.25 | 200 | HTML 5, English |
5644 | delfi.lt | 6640 | 5.25 | 200 | HTML 5 |
5645 | tutorialspoint.com | 6643 | 5.25 | 200 | No Lang |
5646 | jpmorganchase.com | 6644 | 5.25 | 200 | HTML 5, English |
5647 | drudgereport.com | 6645 | 5.24 | 200 | No Lang |
5648 | overdrive.com | 6646 | 5.24 | 200 | HTML 5, English |
5649 | die-linke.de | 6647 | 5.24 | 200 | HTML 5 |
5650 | airbrake.io | 6648 | 5.24 | 200 | HTML 5, No Lang |
5651 | obyte.org | 6649 | 5.24 | 200 | HTML 5, English |
5652 | s3.eu-west-2.amazonaws.com | 6650 | 5.24 | 200 | HTML 5, English |
5653 | freesound.org | 6651 | 5.24 | 200 | HTML 5, English |
5654 | route.com | 6652 | 5.24 | 200 | HTML 5, English |
5655 | travelchannel.com | 6653 | 5.24 | 200 | HTML 5, English |
5656 | dmarc.org | 6654 | 5.24 | 200 | HTML 5, English |
5657 | radiotimes.com | 6655 | 5.24 | 200 | HTML 5, English |
5658 | timesofmalta.com | 6656 | 5.24 | 200 | HTML 5, English |
5659 | veeam.com | 6657 | 5.24 | 200 | HTML 5, English |
5660 | wishtv.com | 6658 | 5.24 | 200 | HTML 5, English |
5661 | pressbooks.com | 6660 | 5.24 | 200 | HTML 5, English |
5662 | extension.usu.edu | 6661 | 5.24 | 200 | HTML 5, English |
5663 | melon.com | 6662 | 5.24 | 200 | HTML 5 |
5664 | thetakeout.com | 6663 | 5.24 | 200 | HTML 5, English |
5665 | scholar.google.ca | 6664 | 5.24 | 200 | HTML 5, No Lang |
5666 | columbian.com | 6665 | 5.24 | 200 | HTML 5, English |
5667 | birminghammail.co.uk | 6666 | 5.24 | 200 | HTML 5, English |
5668 | vive.com | 6667 | 5.24 | 200 | HTML 5, English |
5669 | batcon.org | 6668 | 5.24 | 200 | HTML 5, English |
5670 | aetna.com | 6669 | 5.24 | 200 | No Lang |
5671 | groups.io | 6670 | 5.24 | 200 | HTML 5, English |
5672 | almasryalyoum.com | 6671 | 5.24 | 200 | HTML 5, English |
5673 | blockclubchicago.org | 6674 | 5.24 | 200 | HTML 5, English |
5674 | osce.org | 6675 | 5.24 | 200 | English, Strict |
5675 | draugiem.lv | 6676 | 5.24 | 200 | HTML 5 |
5676 | seositecheckup.com | 6677 | 5.24 | 200 | HTML 5, English |
5677 | care.diabetesjournals.org | 6678 | 5.24 | 200 | No Lang |
5678 | macfound.org | 6679 | 5.24 | 200 | HTML 5, English |
5679 | projects.propublica.org | 6680 | 5.24 | 200 | HTML 5, English |
5680 | xamoom.com | 6681 | 5.24 | 200 | HTML 5, English |
5681 | xspf.org | 6682 | 5.24 | 200 | HTML 5, No Lang |
5682 | kingcounty.gov | 6683 | 5.24 | 200 | HTML 5, English |
5683 | getpaint.net | 6684 | 5.24 | 200 | No Lang |
5684 | roli.com | 6685 | 5.24 | 200 | HTML 5, English |
5685 | duke-energy.com | 6686 | 5.24 | 200 | HTML 5, English |
5686 | ncjrs.gov | 6687 | 5.24 | 200 | HTML 5, English |
5687 | es.scribd.com | 6688 | 5.24 | 200 | HTML 5, English |
5688 | healthcare.utah.edu | 6689 | 5.24 | 200 | HTML 5, English |
5689 | dynamics.microsoft.com | 6690 | 5.24 | 200 | HTML 5, English |
5690 | link.space | 6691 | 5.24 | 200 | HTML 5, English |
5691 | mercadolivre.com.br | 6692 | 5.24 | 200 | HTML 5 |
5692 | carbonbrief.org | 6693 | 5.24 | 200 | HTML 5, English |
5693 | indd.adobe.com | 6694 | 5.24 | 200 | HTML 5, English |
5694 | ovhcloud.com | 6695 | 5.24 | 200 | HTML 5, English |
5695 | fox2now.com | 6696 | 5.24 | 200 | HTML 5, English |
5696 | spectrumlocalnews.com | 6697 | 5.24 | 200 | HTML 5, English |
5697 | rocketlawyer.com | 6698 | 5.24 | 200 | HTML 5, English |
5698 | getrichslowly.org | 6700 | 5.24 | 200 | HTML 5, English |
5699 | datanyze.com | 6701 | 5.24 | 200 | HTML 5, English |
5700 | chegg.com | 6702 | 5.24 | 200 | HTML 5, English |
Data from: Open PageRank