Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16801 | alexgorbatchev.com | 19582 | 4.90 | 200 | HTML 5, English |
16802 | pinterest.at | 19583 | 4.90 | 200 | HTML 5, English |
16803 | desertsun.com | 19584 | 4.90 | 200 | HTML 5, English |
16804 | kipa.co.il | 19585 | 4.90 | 200 | HTML 5 |
16805 | veltra.com | 19587 | 4.90 | 200 | English, Transitional |
16806 | tshaonline.org | 19588 | 4.90 | 200 | HTML 5, English |
16807 | math.rutgers.edu | 19589 | 4.90 | 200 | HTML 5, English |
16808 | cs.illinois.edu | 19590 | 4.90 | 200 | HTML 5, English |
16809 | ibm.biz | 19591 | 4.90 | 200 | HTML 5, English |
16810 | nextbigideaclub.com | 19592 | 4.90 | 200 | HTML 5, English |
16811 | nexcess.net | 19593 | 4.90 | 200 | HTML 5, English |
16812 | codeburst.io | 19594 | 4.90 | 200 | HTML 5, No Lang |
16813 | gdz.sub.uni-goettingen.de | 19595 | 4.90 | 200 | HTML 5 |
16814 | gulf-times.com | 19596 | 4.90 | 200 | HTML 5, English |
16815 | endhomelessness.org | 19597 | 4.90 | 200 | HTML 5, English |
16816 | oks.org.rs | 19599 | 4.90 | 200 | HTML 5 |
16817 | remote.com | 19600 | 4.90 | 200 | HTML 5, English |
16818 | elixir-europe.org | 19601 | 4.90 | 200 | HTML 5, English |
16819 | hrsa.gov | 19602 | 4.90 | 200 | HTML 5, English |
16820 | roskilde-festival.dk | 19603 | 4.90 | 200 | HTML 5 |
16821 | chandlerproject.org | 19604 | 4.90 | 200 | English, Strict |
16822 | freshsheetmusic.com | 19605 | 4.90 | 200 | HTML 5, English |
16823 | cep.lse.ac.uk | 19606 | 4.90 | 200 | HTML 5, English |
16824 | gamingdeputy.com | 19607 | 4.90 | 200 | HTML 5, English |
16825 | agner.org | 19608 | 4.90 | 200 | No Lang |
16826 | sensible.com | 19609 | 4.90 | 200 | HTML 5, English |
16827 | delta.app | 19610 | 4.90 | 200 | HTML 5, English |
16828 | wiki.p2pfoundation.net | 19611 | 4.90 | 200 | HTML 5, English |
16829 | braunschweig.de | 19613 | 4.90 | 200 | HTML 5 |
16830 | lists.oasis-open.org | 19614 | 4.90 | 200 | HTML 5, English |
16831 | wwltv.com | 19615 | 4.90 | 200 | HTML 5, English |
16832 | brandmeister.network | 19616 | 4.90 | 200 | HTML 5, No Lang |
16833 | parliament.nsw.gov.au | 19617 | 4.90 | 200 | English, Strict |
16834 | sudouest.fr | 19618 | 4.90 | 200 | HTML 5 |
16835 | octoverse.github.com | 19619 | 4.90 | 200 | HTML 5, English |
16836 | id.ee | 19620 | 4.90 | 200 | HTML 5 |
16837 | fediverse.party | 19621 | 4.90 | 200 | HTML 5, English |
16838 | i.blackhat.com | 19622 | 4.90 | 200 | No Lang |
16839 | competitions.codalab.org | 19623 | 4.90 | 200 | HTML 5, English |
16840 | voicy.jp | 19624 | 4.90 | 200 | HTML 5 |
16841 | de.finance.yahoo.com | 19625 | 4.90 | 200 | HTML 5, No Lang |
16842 | publicnewsservice.org | 19626 | 4.90 | 200 | HTML 5, No Lang |
16843 | gwtproject.org | 19627 | 4.90 | 200 | HTML 5, No Lang |
16844 | fieldandstream.com | 19628 | 4.90 | 200 | HTML 5, English |
16845 | sescsp.org.br | 19630 | 4.90 | 200 | HTML 5 |
16846 | static.flickr.com | 19631 | 4.90 | 200 | No Lang |
16847 | phoenixnewtimes.com | 19632 | 4.90 | 200 | HTML 5, English |
16848 | primariaclujnapoca.ro | 19634 | 4.90 | 200 | HTML 5 |
16849 | foundry.com | 19635 | 4.90 | 200 | HTML 5, English |
16850 | tatamotors.com | 19636 | 4.90 | 200 | HTML 5, English |
16851 | news.rthk.hk | 19637 | 4.90 | 200 | HTML 5 |
16852 | mynet.com | 19638 | 4.90 | 200 | HTML 5 |
16853 | docs.wp-rocket.me | 19639 | 4.90 | 200 | HTML 5, No Lang |
16854 | billetto.co.uk | 19640 | 4.90 | 200 | HTML 5, English |
16855 | bigmarker.com | 19641 | 4.90 | 200 | HTML 5, No Lang |
16856 | indieauth.spec.indieweb.org | 19642 | 4.90 | 200 | HTML 5, English |
16857 | group.bnpparibas | 19645 | 4.90 | 200 | HTML 5 |
16858 | support.authorize.net | 19646 | 4.90 | 200 | HTML 5, English |
16859 | ri.gov | 19648 | 4.90 | 200 | HTML 5, English |
16860 | movabletype.org | 19649 | 4.90 | 200 | HTML 5, English |
16861 | stlcitysc.com | 19650 | 4.90 | 200 | HTML 5, English |
16862 | bible.org | 19652 | 4.90 | 200 | HTML 5, English |
16863 | reviews.llvm.org | 19654 | 4.90 | 200 | HTML 5, English |
16864 | encodeproject.org | 19655 | 4.90 | 200 | HTML 5, English |
16865 | givingwhatwecan.org | 19656 | 4.90 | 200 | HTML 5, English |
16866 | openeurope.org.uk | 19657 | 4.90 | 200 | HTML 5, English |
16867 | mladina.si | 19658 | 4.90 | 200 | HTML 5, No Lang |
16868 | bughunters.google.com | 19659 | 4.90 | 200 | HTML 5, English |
16869 | lifestyleofafoodie.com | 19660 | 4.90 | 200 | HTML 5, English |
16870 | rogervoice.com | 19662 | 4.90 | 200 | HTML 5, English |
16871 | newsoffice.mit.edu | 19663 | 4.90 | 200 | HTML 5, English |
16872 | www3.ntu.edu.sg | 19664 | 4.90 | 200 | No Lang |
16873 | thecollector.com | 19665 | 4.90 | 200 | HTML 5, English |
16874 | mlcommons.org | 19666 | 4.90 | 200 | HTML 5, English |
16875 | en.paperblog.com | 19667 | 4.90 | 200 | English, Strict |
16876 | mspoweruser.com | 19668 | 4.90 | 200 | HTML 5, English |
16877 | moccae.gov.ae | 19669 | 4.90 | 200 | HTML 5 |
16878 | sweetyhigh.com | 19670 | 4.90 | 200 | HTML 5, English |
16879 | pan.baidu.com | 19671 | 4.90 | 200 | HTML 5, No Lang |
16880 | it.slashdot.org | 19672 | 4.90 | 200 | English |
16881 | wescover.com | 19673 | 4.90 | 200 | HTML 5, English |
16882 | people.debian.org | 19674 | 4.90 | 200 | English, Transitional |
16883 | povertyactionlab.org | 19675 | 4.90 | 200 | HTML 5, English |
16884 | modernretail.co | 19676 | 4.90 | 200 | HTML 5, English |
16885 | colopl.co.jp | 19677 | 4.90 | 200 | HTML 5 |
16886 | haxe.org | 19678 | 4.90 | 200 | HTML 5, No Lang |
16887 | radiosvoboda.org | 19679 | 4.90 | 200 | HTML 5 |
16888 | la.eater.com | 19680 | 4.90 | 200 | HTML 5, English |
16889 | fedlex.admin.ch | 19681 | 4.90 | 200 | HTML 5, No Lang |
16890 | tipjunkie.com | 19682 | 4.90 | 200 | HTML 5, English |
16891 | upstreamonline.com | 19683 | 4.90 | 200 | HTML 5, English |
16892 | nationalbankopen.com | 19684 | 4.90 | 200 | HTML 5, English |
16893 | chess-results.com | 19685 | 4.90 | 200 | English, Transitional |
16894 | secure2.convio.net | 19686 | 4.90 | 200 | No Lang |
16895 | dailycamera.com | 19688 | 4.90 | 200 | HTML 5, English |
16896 | aston.ac.uk | 19689 | 4.90 | 200 | HTML 5, English |
16897 | transcribeme.com | 19690 | 4.90 | 200 | HTML 5, English |
16898 | indiancountrytoday.com | 19691 | 4.90 | 200 | HTML 5, English |
16899 | uned.es | 19692 | 4.90 | 200 | HTML 5 |
16900 | essd.copernicus.org | 19693 | 4.90 | 200 | English, Transitional |
Data from: Open PageRank