Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16201 | digit.fyi | 18882 | 4.91 | 200 | HTML 5, No Lang |
16202 | creative.adobe.com | 18883 | 4.91 | 200 | HTML 5, English |
16203 | books.wwnorton.com | 18885 | 4.91 | 200 | HTML 5, English |
16204 | tracxn.com | 18886 | 4.91 | 200 | HTML 5, English |
16205 | retsd.mb.ca | 18887 | 4.91 | 200 | HTML 5, English |
16206 | distill.pub | 18888 | 4.91 | 200 | HTML 5, English |
16207 | picclick.com | 18889 | 4.91 | 200 | HTML 5, English |
16208 | powerlanguage.co.uk | 18892 | 4.91 | 200 | HTML 5, No Lang |
16209 | lapor.go.id | 18893 | 4.91 | 200 | HTML 5, No Lang |
16210 | bora.uib.no | 18894 | 4.91 | 200 | HTML 5, English |
16211 | gainesville.com | 18895 | 4.91 | 200 | HTML 5, English |
16212 | knoe.com | 18896 | 4.91 | 200 | HTML 5, English |
16213 | leverageedu.com | 18897 | 4.91 | 200 | HTML 5, English |
16214 | larvalabs.com | 18898 | 4.91 | 200 | HTML 5, English |
16215 | muelheim-ruhr.de | 18899 | 4.91 | 200 | Transitional |
16216 | ircam.fr | 18900 | 4.91 | 200 | HTML 5, English |
16217 | ruor.uottawa.ca | 18901 | 4.91 | 200 | HTML 5, English |
16218 | institute.global | 18904 | 4.91 | 200 | HTML 5, English |
16219 | fhs.swiss | 18905 | 4.91 | 200 | HTML 5, No Lang |
16220 | dosbox.com | 18906 | 4.91 | 200 | No Lang, Transitional |
16221 | mitmproxy.org | 18907 | 4.91 | 200 | HTML 5, English |
16222 | businessbecause.com | 18908 | 4.91 | 200 | HTML 5, No Lang |
16223 | researchsquare.com | 18909 | 4.91 | 200 | HTML 5, English |
16224 | usa.ipums.org | 18910 | 4.91 | 200 | HTML 5, English |
16225 | blibli.com | 18912 | 4.91 | 200 | |
16226 | physionet.org | 18913 | 4.91 | 200 | HTML 5, English |
16227 | plan-international.org | 18914 | 4.91 | 200 | HTML 5, English |
16228 | portal.ogc.org | 18916 | 4.91 | 200 | No Lang |
16229 | plannthat.com | 18918 | 4.91 | 200 | HTML 5, English |
16230 | hasselblad.com | 18919 | 4.91 | 200 | HTML 5, English |
16231 | warszawa.wyborcza.pl | 18920 | 4.91 | 200 | HTML 5 |
16232 | metrotimes.com | 18921 | 4.91 | 200 | HTML 5, English |
16233 | inventables.com | 18923 | 4.91 | 200 | HTML 5, English |
16234 | tvspb.ru | 18924 | 4.91 | 200 | HTML 5, No Lang |
16235 | nbc12.com | 18925 | 4.91 | 200 | HTML 5, English |
16236 | techworm.net | 18927 | 4.91 | 200 | English |
16237 | blog.mailchimp.com | 18928 | 4.91 | 200 | HTML 5, English |
16238 | ses.library.usyd.edu.au | 18929 | 4.91 | 200 | HTML 5, English |
16239 | legal.un.org | 18931 | 4.91 | 200 | No Lang, Transitional |
16240 | mbie.govt.nz | 18932 | 4.91 | 200 | HTML 5, English |
16241 | dataprot.net | 18933 | 4.91 | 200 | HTML 5, English |
16242 | wpshout.com | 18934 | 4.91 | 200 | HTML 5, English |
16243 | naturalearthdata.com | 18935 | 4.91 | 200 | English, Transitional |
16244 | natlib.govt.nz | 18936 | 4.91 | 200 | No Lang |
16245 | commanders.com | 18937 | 4.91 | 200 | HTML 5, English |
16246 | cullmantimes.com | 18938 | 4.91 | 200 | HTML 5, English |
16247 | about.nike.com | 18939 | 4.91 | 200 | HTML 5, English |
16248 | wciom.ru | 18940 | 4.91 | 200 | HTML 5 |
16249 | spacy.io | 18941 | 4.91 | 200 | HTML 5, English |
16250 | learn.genetics.utah.edu | 18942 | 4.91 | 200 | HTML 5, English |
16251 | redcross.sg | 18944 | 4.91 | 200 | No Lang |
16252 | theanarchistlibrary.org | 18945 | 4.91 | 200 | HTML 5, English |
16253 | toneden.io | 18946 | 4.91 | 200 | No Lang |
16254 | theedgemalaysia.com | 18947 | 4.91 | 200 | HTML 5, No Lang |
16255 | community.cookiepro.com | 18949 | 4.91 | 200 | No Lang, Transitional |
16256 | ninjakiwi.com | 18950 | 4.91 | 200 | HTML 5, No Lang |
16257 | parl.ca | 18951 | 4.91 | 200 | English |
16258 | fosstodon.org | 18953 | 4.91 | 200 | HTML 5, English |
16259 | boyslife.org | 18954 | 4.91 | 200 | HTML 5, English |
16260 | interactivebrokers.com | 18955 | 4.91 | 200 | HTML 5, English |
16261 | placementindia.com | 18956 | 4.91 | 200 | HTML 5, English |
16262 | luxyhair.com | 18957 | 4.91 | 200 | HTML 5, English |
16263 | resilience.org | 18958 | 4.91 | 200 | HTML 5, English |
16264 | blogs.office.com | 18959 | 4.91 | 200 | HTML 5, English |
16265 | inforum.com | 18960 | 4.91 | 200 | HTML 5, English |
16266 | music-encoding.org | 18961 | 4.91 | 200 | HTML 5, English |
16267 | wvculture.org | 18962 | 4.91 | 200 | HTML 5, English |
16268 | conferences.oreillynet.com | 18963 | 4.91 | 200 | HTML 5, English |
16269 | nav.no | 18964 | 4.91 | 200 | HTML 5 |
16270 | uspsoig.gov | 18965 | 4.91 | 200 | HTML 5, English |
16271 | wiki.linuxfoundation.org | 18966 | 4.91 | 200 | HTML 5, English |
16272 | pixnet.net | 18967 | 4.91 | 200 | HTML 5 |
16273 | practicalselfreliance.com | 18968 | 4.91 | 200 | HTML 5, English |
16274 | feedburner.com | 18969 | 4.91 | 200 | HTML 5, English |
16275 | eclecticlight.co | 18971 | 4.91 | 200 | HTML 5, English |
16276 | star-m.jp | 18972 | 4.91 | 200 | HTML 5 |
16277 | lazard.com | 18973 | 4.91 | 200 | HTML 5, English |
16278 | garant.ru | 18974 | 4.91 | 200 | HTML 5, No Lang |
16279 | paulbourke.net | 18976 | 4.91 | 200 | English |
16280 | indochino.com | 18977 | 4.91 | 200 | HTML 5, English |
16281 | chillicothegazette.com | 18978 | 4.91 | 200 | HTML 5, English |
16282 | nedbatchelder.com | 18979 | 4.91 | 200 | HTML 5, English |
16283 | ayatemplates.com | 18980 | 4.91 | 200 | HTML 5, English |
16284 | atlas.cid.harvard.edu | 18981 | 4.91 | 200 | HTML 5, English |
16285 | trustbank.co.jp | 18983 | 4.91 | 200 | HTML 5 |
16286 | law.ucla.edu | 18984 | 4.91 | 200 | HTML 5, English |
16287 | weezevent.com | 18985 | 4.91 | 200 | HTML 5, English |
16288 | organicthemes.com | 18986 | 4.91 | 200 | HTML 5, English |
16289 | modeltheme.com | 18987 | 4.91 | 200 | HTML 5, English |
16290 | library.uniteddiversity.coop | 18988 | 4.91 | 200 | HTML 5, English |
16291 | bitcoincash.org | 18989 | 4.91 | 200 | HTML 5, English |
16292 | eventbrite.sg | 18990 | 4.91 | 200 | HTML 5, No Lang |
16293 | jsonpatch.com | 18991 | 4.91 | 200 | HTML 5, English |
16294 | manba.co.jp | 18992 | 4.91 | 200 | HTML 5 |
16295 | science.co.il | 18993 | 4.91 | 200 | HTML 5, English |
16296 | wpcampus.org | 18994 | 4.91 | 200 | HTML 5, English |
16297 | bestmovie.it | 18995 | 4.91 | 200 | HTML 5 |
16298 | token2.com | 18997 | 4.91 | 200 | HTML 5, English |
16299 | plotek.pl | 18998 | 4.91 | 200 | HTML 5 |
16300 | linguistlist.org | 18999 | 4.91 | 200 | HTML 5, English |
Data from: Open PageRank