Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
13701 | vie-publique.fr | 15970 | 4.96 | 200 | No Lang |
13702 | almamedia.fi | 15971 | 4.96 | 200 | HTML 5 |
13703 | dados.gov.br | 15972 | 4.96 | 200 | HTML 5, No Lang |
13704 | fivestars.com | 15973 | 4.96 | 200 | HTML 5, English |
13705 | tweetdeck.twitter.com | 15974 | 4.96 | 200 | HTML 5, No Lang |
13706 | imaios.com | 15976 | 4.96 | 200 | HTML 5, English |
13707 | divisare.com | 15977 | 4.96 | 200 | HTML 5, English |
13708 | paymentsjournal.com | 15978 | 4.96 | 200 | HTML 5, English |
13709 | guides.loc.gov | 15979 | 4.96 | 200 | HTML 5, English |
13710 | rcrwireless.com | 15980 | 4.96 | 200 | English |
13711 | stad.gent | 15983 | 4.96 | 200 | HTML 5 |
13712 | agilebits.com | 15984 | 4.96 | 200 | HTML 5, English |
13713 | truthdig.com | 15985 | 4.96 | 200 | HTML 5, English |
13714 | schedulista.com | 15986 | 4.96 | 200 | HTML 5, English |
13715 | cxotoday.com | 15988 | 4.96 | 200 | HTML 5, English |
13716 | telkomsel.com | 15989 | 4.96 | 200 | HTML 5, English |
13717 | payumoney.com | 15990 | 4.96 | 200 | HTML 5, English |
13718 | conservancy.umn.edu | 15991 | 4.96 | 200 | HTML 5, English |
13719 | santatracker.google.com | 15992 | 4.96 | 200 | HTML 5, English |
13720 | amarujala.com | 15993 | 4.96 | 200 | HTML 5 |
13721 | vegas.com | 15994 | 4.96 | 200 | HTML 5, English |
13722 | lists.automattic.com | 15995 | 4.96 | 200 | No Lang |
13723 | coloradoan.com | 15996 | 4.96 | 200 | HTML 5, English |
13724 | ar.pinterest.com | 15997 | 4.96 | 200 | HTML 5, English |
13725 | stan.store | 15998 | 4.96 | 200 | HTML 5, No Lang |
13726 | motorcycleclassics.com | 15999 | 4.96 | 200 | HTML 5, English |
13727 | ua.usembassy.gov | 16000 | 4.96 | 200 | HTML 5, English |
13728 | howstuffworks.com | 16001 | 4.96 | 200 | HTML 5, English |
13729 | adwords.googleblog.com | 16002 | 4.96 | 200 | HTML 5, English |
13730 | mojeek.com | 16003 | 4.96 | 200 | HTML 5, English |
13731 | keywordtool.io | 16004 | 4.96 | 200 | HTML 5, English |
13732 | globaldelight.com | 16005 | 4.96 | 200 | HTML 5, English |
13733 | modernfarmer.com | 16006 | 4.96 | 200 | HTML 5, English |
13734 | blog.schema.org | 16008 | 4.96 | 200 | HTML 5, English |
13735 | issn.org | 16009 | 4.96 | 200 | HTML 5, No Lang |
13736 | rockefellerfoundation.org | 16010 | 4.96 | 200 | HTML 5, English |
13737 | dlsite.com | 16011 | 4.96 | 200 | HTML 5 |
13738 | theclio.com | 16013 | 4.96 | 200 | HTML 5, English |
13739 | animatedsoftware.com | 16014 | 4.96 | 200 | HTML 5, No Lang |
13740 | argaam.com | 16015 | 4.96 | 200 | HTML 5 |
13741 | microbewiki.kenyon.edu | 16016 | 4.96 | 200 | HTML 5, English |
13742 | topics.nintendo.co.jp | 16017 | 4.96 | 200 | HTML 5 |
13743 | bdtask.com | 16018 | 4.96 | 200 | HTML 5, English |
13744 | cybercivilrights.org | 16019 | 4.96 | 200 | HTML 5, English |
13745 | wgntv.com | 16022 | 4.96 | 200 | HTML 5, English |
13746 | travel.sygic.com | 16023 | 4.96 | 200 | HTML 5, No Lang |
13747 | jedec.org | 16024 | 4.96 | 200 | HTML 5, English |
13748 | wyborcza.pl | 16025 | 4.96 | 200 | HTML 5 |
13749 | fr.le360.ma | 16026 | 4.96 | 200 | HTML 5 |
13750 | oceanexplorer.noaa.gov | 16027 | 4.96 | 200 | HTML 5, English |
13751 | asiatoday.co.kr | 16028 | 4.96 | 200 | HTML 5 |
13752 | halleonard.com | 16029 | 4.96 | 200 | HTML 5, English |
13753 | orthobullets.com | 16030 | 4.96 | 200 | HTML 5, English |
13754 | content.lib.washington.edu | 16032 | 4.96 | 200 | No Lang, Transitional |
13755 | getsession.org | 16033 | 4.96 | 200 | HTML 5, English |
13756 | ai.glossika.com | 16034 | 4.96 | 200 | HTML 5, English |
13757 | righto.com | 16035 | 4.96 | 200 | HTML 5, No Lang |
13758 | fashionmodeldirectory.com | 16036 | 4.96 | 200 | HTML 5, No Lang |
13759 | openmrs.org | 16037 | 4.96 | 200 | HTML 5, English |
13760 | funnyjunk.com | 16038 | 4.96 | 200 | HTML 5, English |
13761 | scn.sap.com | 16039 | 4.96 | 200 | HTML 5, English |
13762 | azattyk.org | 16040 | 4.96 | 200 | HTML 5 |
13763 | oshwa.org | 16041 | 4.96 | 200 | HTML 5, English |
13764 | uni-augsburg.de | 16042 | 4.96 | 200 | HTML 5, English |
13765 | journalofaccountancy.com | 16043 | 4.96 | 200 | HTML 5, English |
13766 | istoe.com.br | 16044 | 4.96 | 200 | HTML 5 |
13767 | lists.webkit.org | 16045 | 4.96 | 200 | No Lang |
13768 | 5by5.tv | 16046 | 4.96 | 200 | HTML 5, English |
13769 | colorhexa.com | 16047 | 4.96 | 200 | HTML 5, English |
13770 | astound.com | 16048 | 4.96 | 200 | HTML 5, English |
13771 | greaterkashmir.com | 16049 | 4.96 | 200 | HTML 5, English |
13772 | lionsroar.com | 16050 | 4.96 | 200 | HTML 5, English |
13773 | gnome-look.org | 16051 | 4.96 | 200 | HTML 5, English |
13774 | starwoodhotels.com | 16052 | 4.96 | 200 | HTML 5, English |
13775 | witness.org | 16053 | 4.96 | 200 | HTML 5, English |
13776 | goto.com | 16054 | 4.96 | 200 | HTML 5, English |
13777 | asme.org | 16056 | 4.96 | 200 | HTML 5, English |
13778 | oapen.org | 16058 | 4.96 | 200 | HTML 5, English |
13779 | itcilo.org | 16059 | 4.96 | 200 | HTML 5, English |
13780 | databreaches.net | 16060 | 4.96 | 200 | HTML 5, English |
13781 | logmi.jp | 16061 | 4.96 | 200 | HTML 5 |
13782 | ica.se | 16063 | 4.96 | 200 | HTML 5 |
13783 | ncat.edu | 16064 | 4.96 | 200 | HTML 5, English |
13784 | helgeklein.com | 16065 | 4.96 | 200 | HTML 5, English |
13785 | ashoka.org | 16067 | 4.96 | 200 | HTML 5, English |
13786 | seaworld.com | 16068 | 4.96 | 200 | HTML 5, English |
13787 | ihl-databases.icrc.org | 16069 | 4.96 | 200 | HTML 5, English |
13788 | pobox.com | 16070 | 4.96 | 200 | HTML 5, English |
13789 | dlib.nyu.edu | 16071 | 4.96 | 200 | HTML 5, English |
13790 | unive.it | 16072 | 4.96 | 200 | HTML 5 |
13791 | annualreports.com | 16073 | 4.96 | 200 | HTML 5, English |
13792 | bas.ac.uk | 16074 | 4.96 | 200 | HTML 5, English |
13793 | fox13news.com | 16075 | 4.96 | 200 | HTML 5, English |
13794 | mopria.org | 16076 | 4.96 | 200 | HTML 5, English |
13795 | library.harvard.edu | 16077 | 4.96 | 200 | HTML 5, English |
13796 | halfbrick.com | 16078 | 4.96 | 200 | HTML 5, English |
13797 | pubmatic.com | 16079 | 4.96 | 200 | HTML 5, English |
13798 | cordcuttersnews.com | 16080 | 4.96 | 200 | HTML 5, English |
13799 | bloomthis.co | 16082 | 4.96 | 200 | HTML 5, English |
13800 | src.chromium.org | 16083 | 4.96 | 200 | HTML 5, No Lang |
Data from: Open PageRank