Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
13501 | opcw.org | 15730 | 4.96 | 200 | HTML 5, English |
13502 | wicg.io | 15731 | 4.96 | 200 | HTML 5, No Lang |
13503 | dxomark.com | 15732 | 4.96 | 200 | HTML 5, English |
13504 | webofstories.com | 15733 | 4.96 | 200 | English, Transitional |
13505 | bedford.gov.uk | 15734 | 4.96 | 200 | HTML 5, English |
13506 | presidentialinnovationfellows.gov | 15736 | 4.96 | 200 | HTML 5, English |
13507 | yaleclimateconnections.org | 15737 | 4.96 | 200 | HTML 5, English |
13508 | netlib.org | 15738 | 4.96 | 200 | No Lang |
13509 | homechef.com | 15740 | 4.96 | 200 | HTML 5, English |
13510 | fiba.basketball | 15741 | 4.96 | 200 | HTML 5, English |
13511 | tanzu.vmware.com | 15742 | 4.96 | 200 | HTML 5, English |
13512 | hot.ee | 15743 | 4.96 | 200 | HTML 5, English |
13513 | leagle.com | 15744 | 4.96 | 200 | HTML 5, English |
13514 | computerbild.de | 15745 | 4.96 | 200 | HTML 5 |
13515 | transcripts.cnn.com | 15747 | 4.96 | 200 | HTML 5, English |
13516 | cityofpasadena.net | 15748 | 4.96 | 200 | HTML 5, English |
13517 | geneseo.edu | 15749 | 4.96 | 200 | HTML 5, English |
13518 | emailselfdefense.fsf.org | 15750 | 4.96 | 200 | HTML 5, English |
13519 | volunteermatch.org | 15751 | 4.96 | 200 | HTML 5, English |
13520 | en.yna.co.kr | 15752 | 4.96 | 200 | HTML 5, English |
13521 | websitecarbon.com | 15754 | 4.96 | 200 | HTML 5, English |
13522 | outsports.com | 15755 | 4.96 | 200 | HTML 5, English |
13523 | jorudan.co.jp | 15756 | 4.96 | 200 | HTML 5 |
13524 | labsmobile.com | 15757 | 4.96 | 200 | HTML 5, English |
13525 | avro.apache.org | 15758 | 4.96 | 200 | HTML 5, English |
13526 | jsonapi.org | 15759 | 4.96 | 200 | HTML 5, No Lang |
13527 | pocketgamer.biz | 15760 | 4.96 | 200 | HTML 5, English |
13528 | aruba.com | 15761 | 4.96 | 200 | HTML 5, English |
13529 | gate.io | 15762 | 4.96 | 200 | HTML 5, English |
13530 | ebooks.iospress.nl | 15764 | 4.96 | 200 | HTML 5, No Lang |
13531 | phac-aspc.gc.ca | 15765 | 4.96 | 200 | HTML 5, English |
13532 | kr.pinterest.com | 15766 | 4.96 | 200 | HTML 5, English |
13533 | alumni.media.mit.edu | 15767 | 4.96 | 200 | No Lang |
13534 | hosted.ap.org | 15768 | 4.96 | 200 | HTML 5, English |
13535 | skyandtelescope.com | 15769 | 4.96 | 200 | HTML 5, English |
13536 | astro.com.my | 15770 | 4.96 | 200 | HTML 5, English |
13537 | polytechnique.edu | 15771 | 4.96 | 200 | HTML 5 |
13538 | docs.spring.io | 15772 | 4.96 | 200 | HTML 5, English |
13539 | ekathimerini.com | 15773 | 4.96 | 200 | HTML 5, English |
13540 | cs.tut.fi | 15775 | 4.96 | 200 | HTML 5 |
13541 | simpletexting.com | 15776 | 4.96 | 200 | HTML 5, English |
13542 | fauna.com | 15777 | 4.96 | 200 | HTML 5, English |
13543 | kcet.org | 15779 | 4.96 | 200 | HTML 5, English |
13544 | map.yahoo.co.jp | 15782 | 4.96 | 200 | HTML 5 |
13545 | newsru.co.il | 15783 | 4.96 | 200 | |
13546 | librarian.net | 15785 | 4.96 | 200 | HTML 5, English |
13547 | pagina12.com.ar | 15786 | 4.96 | 200 | HTML 5 |
13548 | lamoncloa.gob.es | 15787 | 4.96 | 200 | HTML 5, English |
13549 | orchidspecies.com | 15788 | 4.96 | 200 | No Lang |
13550 | libera.chat | 15789 | 4.96 | 200 | HTML 5, English |
13551 | about.flipboard.com | 15790 | 4.96 | 200 | HTML 5, English |
13552 | homedit.com | 15791 | 4.96 | 200 | HTML 5, English |
13553 | hautehijab.com | 15792 | 4.96 | 200 | HTML 5, English |
13554 | thefoundry.co.uk | 15793 | 4.96 | 200 | HTML 5, English |
13555 | hsbc.com.hk | 15794 | 4.96 | 200 | HTML 5, English |
13556 | archive-it.org | 15795 | 4.96 | 200 | HTML 5, English |
13557 | dot.kde.org | 15797 | 4.96 | 200 | HTML 5, English |
13558 | chevron.com | 15798 | 4.96 | 200 | HTML 5, English |
13559 | mars.com | 15799 | 4.96 | 200 | HTML 5, English |
13560 | lists.freedesktop.org | 15800 | 4.96 | 200 | No Lang |
13561 | democrats.org | 15801 | 4.96 | 200 | HTML 5, English |
13562 | metro.us | 15802 | 4.96 | 200 | HTML 5, English |
13563 | aenetworks.com | 15804 | 4.96 | 200 | HTML 5, English |
13564 | swansea.ac.uk | 15807 | 4.96 | 200 | HTML 5, English |
13565 | netpbm.sourceforge.net | 15808 | 4.96 | 200 | No Lang |
13566 | shortyawards.com | 15809 | 4.96 | 200 | HTML 5, No Lang |
13567 | userpages.umbc.edu | 15810 | 4.96 | 200 | No Lang |
13568 | jta.org | 15811 | 4.96 | 200 | HTML 5, English |
13569 | autosar.org | 15812 | 4.96 | 200 | HTML 5, English |
13570 | tg4.ie | 15813 | 4.96 | 200 | HTML 5 |
13571 | mydramalist.com | 15814 | 4.96 | 200 | HTML 5, English |
13572 | lumberjocks.com | 15815 | 4.96 | 200 | HTML 5, English |
13573 | cegid.com | 15816 | 4.96 | 200 | HTML 5, English |
13574 | jewishcurrents.org | 15817 | 4.96 | 200 | HTML 5, English |
13575 | addicted2success.com | 15818 | 4.96 | 200 | HTML 5, English |
13576 | forums2.gardenweb.com | 15819 | 4.96 | 200 | HTML 5, English |
13577 | ajmc.com | 15820 | 4.96 | 200 | HTML 5, English |
13578 | codeforces.com | 15821 | 4.96 | 200 | English |
13579 | www1.ncdc.noaa.gov | 15822 | 4.96 | 200 | No Lang |
13580 | dccomics.com | 15824 | 4.96 | 200 | HTML 5, English |
13581 | wacken.com | 15826 | 4.96 | 200 | HTML 5, No Lang |
13582 | architectuur.nl | 15827 | 4.96 | 200 | HTML 5, No Lang |
13583 | gearpatrol.com | 15829 | 4.96 | 200 | HTML 5, English |
13584 | lists.debian.org | 15830 | 4.96 | 200 | English, Strict |
13585 | produto.mercadolivre.com.br | 15831 | 4.96 | 200 | HTML 5 |
13586 | wvgazettemail.com | 15832 | 4.96 | 200 | HTML 5, English |
13587 | radiodisneyclub.fr | 15833 | 4.96 | 200 | |
13588 | shine.cn | 15834 | 4.96 | 200 | English |
13589 | fck.de | 15836 | 4.96 | 200 | HTML 5 |
13590 | unito.it | 15838 | 4.96 | 200 | |
13591 | tomorrowland.com | 15839 | 4.96 | 200 | HTML 5, No Lang |
13592 | imperialviolet.org | 15840 | 4.96 | 200 | HTML 5, English |
13593 | skybound.com | 15841 | 4.96 | 200 | HTML 5, English |
13594 | condor.com | 15842 | 4.96 | 200 | HTML 5, English |
13595 | yourls.org | 15844 | 4.96 | 200 | HTML 5, English |
13596 | developers.braintreepayments.com | 15845 | 4.96 | 200 | HTML 5, English |
13597 | m-w.com | 15846 | 4.96 | 200 | HTML 5, English |
13598 | panynj.gov | 15847 | 4.96 | 200 | HTML 5, English |
13599 | wikivoyage.org | 15848 | 4.96 | 200 | HTML 5, English |
13600 | acumbamail.com | 15849 | 4.96 | 200 | HTML 5 |
Data from: Open PageRank