Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
14301 | p2theme.com | 16672 | 4.94 | 200 | HTML 5, English |
14302 | tomdispatch.com | 16675 | 4.94 | 200 | HTML 5, English |
14303 | britishchambers.org.uk | 16676 | 4.94 | 200 | HTML 5, English |
14304 | theseoframework.com | 16677 | 4.94 | 200 | HTML 5, English |
14305 | uni-halle.de | 16678 | 4.94 | 200 | English, Transitional |
14306 | inforchannel.com.br | 16679 | 4.94 | 200 | HTML 5 |
14307 | pz-news.de | 16680 | 4.94 | 200 | HTML 5 |
14308 | shopping.yahoo.co.jp | 16681 | 4.94 | 200 | HTML 5 |
14309 | electronjs.org | 16682 | 4.94 | 200 | HTML 5, English |
14310 | cdu.de | 16683 | 4.94 | 200 | HTML 5 |
14311 | tcpdump.org | 16684 | 4.94 | 200 | HTML 5, English |
14312 | zh.wikisource.org | 16685 | 4.94 | 200 | HTML 5, No Lang |
14313 | chp.ca.gov | 16686 | 4.94 | 200 | English |
14314 | rpmfusion.org | 16687 | 4.94 | 200 | No Lang, Strict |
14315 | huffingtonpost.es | 16688 | 4.94 | 200 | HTML 5 |
14316 | giustizia.it | 16689 | 4.94 | 200 | HTML 5 |
14317 | rmit.edu.au | 16690 | 4.94 | 200 | HTML 5, English |
14318 | metro.ca | 16691 | 4.94 | 200 | HTML 5 |
14319 | explainthatstuff.com | 16692 | 4.94 | 200 | HTML 5, No Lang |
14320 | lookandlearn.com | 16693 | 4.94 | 200 | HTML 5, No Lang |
14321 | bdsmovement.net | 16694 | 4.94 | 200 | HTML 5, English |
14322 | ohio.com | 16695 | 4.94 | 200 | HTML 5, English |
14323 | ribbonfarm.com | 16696 | 4.94 | 200 | English, Transitional |
14324 | medialibrary.it | 16697 | 4.94 | 200 | HTML 5 |
14325 | lgtm.com | 16698 | 4.94 | 200 | HTML 5, English |
14326 | datacatalog.worldbank.org | 16699 | 4.94 | 200 | HTML 5, English |
14327 | nwo.nl | 16700 | 4.94 | 200 | HTML 5 |
14328 | laposta.nl | 16701 | 4.94 | 200 | HTML 5 |
14329 | berlinale.de | 16702 | 4.94 | 200 | HTML 5, English |
14330 | hpcwire.com | 16703 | 4.94 | 200 | HTML 5, English |
14331 | nationalcar.com | 16705 | 4.94 | 200 | HTML 5, English |
14332 | cda-adc.ca | 16708 | 4.94 | 200 | HTML 5, No Lang |
14333 | chinesestandard.net | 16709 | 4.94 | 200 | HTML 5, English |
14334 | aqicn.org | 16710 | 4.94 | 200 | English |
14335 | pods.io | 16713 | 4.94 | 200 | HTML 5, English |
14336 | dukascopy.com | 16714 | 4.94 | 200 | HTML 5, English |
14337 | fatherly.com | 16715 | 4.94 | 200 | HTML 5, English |
14338 | plasticsurgery.org | 16717 | 4.94 | 200 | HTML 5, English |
14339 | europeanpaymentscouncil.eu | 16718 | 4.94 | 200 | HTML 5, English |
14340 | glasspockets.org | 16719 | 4.94 | 200 | No Lang |
14341 | rtl.nl | 16720 | 4.94 | 200 | HTML 5 |
14342 | acma.gov.au | 16721 | 4.94 | 200 | HTML 5, English |
14343 | sci-hub.se | 16722 | 4.94 | 200 | HTML 5, English |
14344 | store.arduino.cc | 16723 | 4.94 | 200 | HTML 5, English |
14345 | rafaelnadal.com | 16724 | 4.94 | 200 | HTML 5 |
14346 | marylandhealthconnection.gov | 16725 | 4.94 | 200 | English |
14347 | data.overheid.nl | 16726 | 4.94 | 200 | HTML 5 |
14348 | gobiernodecanarias.org | 16727 | 4.94 | 200 | HTML 5 |
14349 | domino.com | 16728 | 4.94 | 200 | HTML 5, English |
14350 | furniturevillage.co.uk | 16729 | 4.94 | 200 | HTML 5, English |
14351 | radio24.ilsole24ore.com | 16730 | 4.94 | 200 | HTML 5 |
14352 | read.amazon.com | 16731 | 4.94 | 200 | HTML 5, English |
14353 | shonenjumpplus.com | 16732 | 4.94 | 200 | HTML 5 |
14354 | thehappyfoodie.co.uk | 16733 | 4.94 | 200 | No Lang |
14355 | cubiq.org | 16734 | 4.94 | 200 | No Lang |
14356 | foresight.org | 16735 | 4.94 | 200 | HTML 5, English |
14357 | broadway.com | 16736 | 4.94 | 200 | HTML 5, English |
14358 | ssrc.org | 16737 | 4.94 | 200 | HTML 5, English |
14359 | bbva.com | 16738 | 4.94 | 200 | HTML 5, English |
14360 | comic-walker.com | 16739 | 4.94 | 200 | HTML 5 |
14361 | digitalhumanities.org | 16740 | 4.94 | 200 | HTML 5, English |
14362 | dlapiper.com | 16741 | 4.94 | 200 | HTML 5, English |
14363 | afternic.com | 16742 | 4.94 | 200 | HTML 5, English |
14364 | vidio.com | 16743 | 4.94 | 200 | HTML 5, English |
14365 | choice.com.au | 16746 | 4.94 | 200 | HTML 5, English |
14366 | podnews.net | 16747 | 4.94 | 200 | HTML 5, English |
14367 | trekmovie.com | 16748 | 4.94 | 200 | HTML 5, English |
14368 | dnevnik.rs | 16749 | 4.94 | 200 | HTML 5 |
14369 | deutsche-digitale-bibliothek.de | 16750 | 4.94 | 200 | HTML 5 |
14370 | ilovefreesoftware.com | 16751 | 4.94 | 200 | HTML 5, English |
14371 | amazingfacts.org | 16752 | 4.94 | 200 | HTML 5, English |
14372 | algorithmwatch.org | 16753 | 4.94 | 200 | HTML 5, English |
14373 | thebodyshop.com | 16755 | 4.94 | 200 | HTML 5, English |
14374 | empa.ch | 16756 | 4.94 | 200 | HTML 5, English |
14375 | smart.com.ph | 16757 | 4.94 | 200 | HTML 5, No Lang |
14376 | shipspotting.com | 16758 | 4.94 | 200 | HTML 5, English |
14377 | imstat.org | 16759 | 4.94 | 200 | HTML 5, English |
14378 | picasa.google.com | 16761 | 4.94 | 200 | HTML 5, English |
14379 | aft.org | 16762 | 4.94 | 200 | HTML 5, English |
14380 | browser.geekbench.com | 16763 | 4.94 | 200 | HTML 5, English |
14381 | open-std.org | 16764 | 4.94 | 200 | No Lang |
14382 | public.oed.com | 16765 | 4.94 | 200 | HTML 5, English |
14383 | rvo.nl | 16766 | 4.94 | 200 | HTML 5 |
14384 | webappick.com | 16767 | 4.94 | 200 | HTML 5, English |
14385 | blogs.spectator.co.uk | 16768 | 4.94 | 200 | HTML 5, English |
14386 | falstad.com | 16769 | 4.94 | 200 | No Lang |
14387 | localsyr.com | 16770 | 4.94 | 200 | HTML 5, English |
14388 | magnumphotos.com | 16771 | 4.94 | 200 | HTML 5, English |
14389 | ashmolean.org | 16772 | 4.94 | 200 | HTML 5, English |
14390 | fcaugsburg.de | 16773 | 4.94 | 200 | HTML 5 |
14391 | theorg.com | 16774 | 4.94 | 200 | HTML 5, English |
14392 | sermonaudio.com | 16776 | 4.94 | 200 | HTML 5, English |
14393 | culturacolectiva.com | 16777 | 4.94 | 200 | HTML 5 |
14394 | portforward.com | 16778 | 4.94 | 200 | HTML 5, English |
14395 | thefw.com | 16779 | 4.94 | 200 | HTML 5, English |
14396 | us.burberry.com | 16780 | 4.94 | 200 | HTML 5, English |
14397 | ngv.vic.gov.au | 16781 | 4.94 | 200 | HTML 5, English |
14398 | itu.dk | 16782 | 4.94 | 200 | HTML 5, English |
14399 | travelsouthdakota.com | 16783 | 4.94 | 200 | HTML 5, English |
14400 | crooksandliars.com | 16784 | 4.94 | 200 | HTML 5, English |
Data from: Open PageRank