Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
8601 | jakearchibald.com | 10033 | 5.09 | 200 | HTML 5, English |
8602 | fridaysforfuture.org | 10034 | 5.09 | 200 | HTML 5, English |
8603 | harley-davidson.com | 10035 | 5.09 | 200 | HTML 5, English |
8604 | seclists.org | 10036 | 5.09 | 200 | HTML 5, English |
8605 | arabianbusiness.com | 10037 | 5.09 | 200 | HTML 5, English |
8606 | global.canon | 10040 | 5.09 | 200 | HTML 5, English |
8607 | finance.google.com | 10041 | 5.09 | 200 | HTML 5, English |
8608 | googlecode.blogspot.com | 10042 | 5.09 | 200 | HTML 5, No Lang |
8609 | donga.com | 10043 | 5.09 | 200 | HTML 5 |
8610 | perfectketo.com | 10045 | 5.09 | 200 | HTML 5, English |
8611 | ya.ru | 10046 | 5.09 | 200 | HTML 5 |
8612 | pendleton-usa.com | 10047 | 5.09 | 200 | HTML 5, English |
8613 | ngm.nationalgeographic.com | 10048 | 5.09 | 200 | HTML 5, English |
8614 | idpay.ir | 10049 | 5.09 | 200 | HTML 5 |
8615 | bcs.org | 10050 | 5.09 | 200 | HTML 5, English |
8616 | blog.taragana.com | 10051 | 5.09 | 200 | HTML 5, English |
8617 | gov.si | 10052 | 5.09 | 200 | HTML 5 |
8618 | wiki.videolan.org | 10055 | 5.09 | 200 | HTML 5, English |
8619 | athenahealth.com | 10056 | 5.09 | 200 | HTML 5, English |
8620 | college-de-france.fr | 10057 | 5.09 | 200 | HTML 5, English |
8621 | muenchen.de | 10058 | 5.09 | 200 | HTML 5 |
8622 | ooni.org | 10059 | 5.09 | 200 | HTML 5, English |
8623 | us.shein.com | 10060 | 5.09 | 200 | HTML 5, English |
8624 | thesartorialist.com | 10061 | 5.09 | 200 | HTML 5, English |
8625 | oreillynet.com | 10062 | 5.09 | 200 | HTML 5, English |
8626 | lens.org | 10063 | 5.09 | 200 | HTML 5, English |
8627 | bench.co | 10064 | 5.09 | 200 | HTML 5, English |
8628 | creditcards.com | 10065 | 5.09 | 200 | HTML 5, English |
8629 | case-mate.com | 10066 | 5.09 | 200 | HTML 5, English |
8630 | 80000hours.org | 10067 | 5.09 | 200 | HTML 5, English |
8631 | beinsports.com | 10068 | 5.09 | 200 | HTML 5, English |
8632 | straight.com | 10069 | 5.09 | 200 | HTML 5, English |
8633 | theora.org | 10070 | 5.09 | 200 | No Lang, Strict |
8634 | hvv.de | 10071 | 5.09 | 200 | HTML 5 |
8635 | unssc.org | 10072 | 5.09 | 200 | HTML 5, English |
8636 | vanguard.com | 10074 | 5.09 | 200 | HTML 5, English |
8637 | deepmind.google | 10075 | 5.09 | 200 | HTML 5, English |
8638 | chinadigitaltimes.net | 10076 | 5.09 | 200 | HTML 5, English |
8639 | tieba.baidu.com | 10077 | 5.09 | 200 | HTML 5, No Lang |
8640 | en.uesp.net | 10078 | 5.09 | 200 | HTML 5, English |
8641 | investigationdiscovery.com | 10079 | 5.09 | 200 | HTML 5, English |
8642 | icj-cij.org | 10081 | 5.09 | 200 | HTML 5, English |
8643 | meta.trac.wordpress.org | 10083 | 5.09 | 200 | No Lang, Strict |
8644 | ca.finance.yahoo.com | 10084 | 5.09 | 200 | HTML 5, English |
8645 | rnd.de | 10085 | 5.09 | 200 | HTML 5 |
8646 | support.kaspersky.com | 10086 | 5.09 | 200 | HTML 5, English |
8647 | thezoereport.com | 10087 | 5.09 | 200 | HTML 5, English |
8648 | oswego.edu | 10088 | 5.09 | 200 | HTML 5, English |
8649 | openstack.org | 10089 | 5.09 | 200 | HTML 5, No Lang |
8650 | toronto.citynews.ca | 10090 | 5.09 | 200 | HTML 5, English |
8651 | typography.com | 10091 | 5.09 | 200 | HTML 5, English |
8652 | gds.blog.gov.uk | 10092 | 5.09 | 200 | HTML 5, English |
8653 | cbd.int | 10093 | 5.09 | 200 | HTML 5, English |
8654 | biologicaldiversity.org | 10094 | 5.09 | 200 | HTML 5, English |
8655 | bigcartel.com | 10095 | 5.09 | 200 | HTML 5, English |
8656 | userlike.com | 10096 | 5.09 | 200 | HTML 5, No Lang |
8657 | darc.de | 10097 | 5.09 | 200 | HTML 5 |
8658 | crayola.com | 10098 | 5.09 | 200 | No Lang |
8659 | fanyi.baidu.com | 10099 | 5.09 | 200 | HTML 5, No Lang |
8660 | harrypotter.wikia.com | 10100 | 5.09 | 200 | HTML 5, English |
8661 | crave.ca | 10101 | 5.09 | 200 | HTML 5, English |
8662 | leipzig.de | 10102 | 5.09 | 200 | HTML 5 |
8663 | codeplex.com | 10103 | 5.09 | 200 | HTML 5, English |
8664 | aoa.org | 10104 | 5.09 | 200 | HTML 5, English |
8665 | sheetmusicplus.com | 10106 | 5.09 | 200 | HTML 5, English |
8666 | pubdocs.worldbank.org | 10107 | 5.09 | 200 | No Lang, Transitional |
8667 | fiercepharma.com | 10108 | 5.09 | 200 | HTML 5, English |
8668 | dotcom-monitor.com | 10110 | 5.09 | 200 | HTML 5, English |
8669 | comingsoon.it | 10111 | 5.09 | 200 | HTML 5 |
8670 | usaspending.gov | 10112 | 5.09 | 200 | HTML 5, English |
8671 | autotrader.com | 10113 | 5.09 | 200 | HTML 5, English |
8672 | yithemes.com | 10114 | 5.09 | 200 | HTML 5, English |
8673 | optimizepress.com | 10117 | 5.09 | 200 | HTML 5, English |
8674 | cs.helsinki.fi | 10118 | 5.09 | 200 | HTML 5, English |
8675 | note.mu | 10119 | 5.09 | 200 | HTML 5 |
8676 | medium.freecodecamp.com | 10120 | 5.09 | 200 | HTML 5, English |
8677 | ncsasports.org | 10121 | 5.09 | 200 | HTML 5, English |
8678 | justataste.com | 10122 | 5.09 | 200 | HTML 5, English |
8679 | sourceware.org | 10123 | 5.09 | 200 | No Lang |
8680 | president.gov.ua | 10124 | 5.09 | 200 | HTML 5 |
8681 | truthforlife.org | 10126 | 5.09 | 200 | HTML 5, English |
8682 | virginia.edu | 10127 | 5.09 | 200 | HTML 5, English |
8683 | kb.vmware.com | 10128 | 5.09 | 200 | HTML 5, English |
8684 | adb.org | 10129 | 5.09 | 200 | HTML 5, English |
8685 | insights.stackoverflow.com | 10130 | 5.09 | 200 | HTML 5, No Lang |
8686 | enoughproject.org | 10131 | 5.09 | 200 | HTML 5, English |
8687 | mbank.pl | 10132 | 5.09 | 200 | HTML 5 |
8688 | ftp.iza.org | 10133 | 5.09 | 200 | No Lang |
8689 | ntu.edu.sg | 10134 | 5.09 | 200 | HTML 5, English |
8690 | postmansmtp.com | 10135 | 5.09 | 200 | HTML 5, English |
8691 | valentino.com | 10136 | 5.09 | 200 | HTML 5, English |
8692 | tudelft.nl | 10137 | 5.09 | 200 | HTML 5 |
8693 | forum.ghost.org | 10138 | 5.09 | 200 | HTML 5, English |
8694 | git.sr.ht | 10139 | 5.09 | 200 | HTML 5, No Lang |
8695 | wspa.com | 10140 | 5.09 | 200 | HTML 5, English |
8696 | inquisitr.com | 10142 | 5.09 | 200 | HTML 5, English |
8697 | leatherman.com | 10143 | 5.09 | 200 | HTML 5, English |
8698 | online.flippingbook.com | 10144 | 5.09 | 200 | HTML 5, English |
8699 | wonderkind.de | 10145 | 5.09 | 200 | HTML 5, English |
8700 | givingtuesday.org | 10146 | 5.09 | 200 | HTML 5, English |
Data from: Open PageRank