Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
15801 | fox19.com | 18417 | 4.91 | 200 | HTML 5, English |
15802 | humanrightsfirst.org | 18418 | 4.91 | 200 | HTML 5, English |
15803 | data.sfgov.org | 18419 | 4.91 | 200 | HTML 5, English |
15804 | wiwiss.fu-berlin.de | 18420 | 4.91 | 200 | HTML 5 |
15805 | simonandschuster.co.uk | 18421 | 4.91 | 200 | HTML 5, English |
15806 | teads.com | 18423 | 4.91 | 200 | HTML 5, English |
15807 | pinktentacle.com | 18424 | 4.91 | 200 | English, Strict |
15808 | teradata.com | 18425 | 4.91 | 200 | HTML 5, English |
15809 | onnit.com | 18426 | 4.91 | 200 | HTML 5, English |
15810 | thegazette.co.uk | 18427 | 4.91 | 200 | HTML 5, English |
15811 | soundbrenner.com | 18428 | 4.91 | 200 | HTML 5, English |
15812 | encyclopedia2.thefreedictionary.com | 18429 | 4.91 | 200 | HTML 5, No Lang |
15813 | marinsoftware.com | 18430 | 4.91 | 200 | HTML 5, English |
15814 | mondly.com | 18431 | 4.91 | 200 | HTML 5, English |
15815 | fox40.com | 18432 | 4.91 | 200 | HTML 5, English |
15816 | sahistory.org.za | 18433 | 4.91 | 200 | HTML 5, English |
15817 | humansystems.arc.nasa.gov | 18434 | 4.91 | 200 | No Lang, Transitional |
15818 | tinychat.com | 18435 | 4.91 | 200 | HTML 5, English |
15819 | erasmus-plus.ec.europa.eu | 18436 | 4.91 | 200 | HTML 5, English |
15820 | altera.com | 18437 | 4.91 | 200 | HTML 5, English |
15821 | cis-india.org | 18438 | 4.91 | 200 | English, Transitional |
15822 | fedoramagazine.org | 18439 | 4.91 | 200 | HTML 5, English |
15823 | watfordobserver.co.uk | 18440 | 4.91 | 200 | HTML 5, English |
15824 | montanasports.com | 18441 | 4.91 | 200 | HTML 5, English |
15825 | thenakedscientists.com | 18442 | 4.91 | 200 | HTML 5, English |
15826 | scholar.google.at | 18443 | 4.91 | 200 | HTML 5, No Lang |
15827 | internshala.com | 18444 | 4.91 | 200 | HTML 5, English |
15828 | classicalconversations.com | 18445 | 4.91 | 200 | HTML 5, English |
15829 | insightcrime.org | 18446 | 4.91 | 200 | HTML 5, English |
15830 | prestocard.ca | 18447 | 4.91 | 200 | HTML 5, English |
15831 | beincrypto.com | 18448 | 4.91 | 200 | HTML 5, English |
15832 | haas.berkeley.edu | 18450 | 4.91 | 200 | HTML 5, English |
15833 | barometern.se | 18451 | 4.91 | 200 | HTML 5 |
15834 | lwv.org | 18453 | 4.91 | 200 | HTML 5, English |
15835 | qualys.com | 18454 | 4.91 | 200 | HTML 5, English |
15836 | verbraucherzentrale.de | 18455 | 4.91 | 200 | HTML 5 |
15837 | vudu.com | 18456 | 4.91 | 200 | HTML 5, English |
15838 | tv-asahi.co.jp | 18457 | 4.91 | 200 | HTML 5 |
15839 | epilepsy.org.uk | 18458 | 4.91 | 200 | HTML 5, English |
15840 | thefabricator.com | 18459 | 4.91 | 200 | HTML 5, English |
15841 | somerset.qld.gov.au | 18460 | 4.91 | 200 | HTML 5, English |
15842 | eventhorizontelescope.org | 18461 | 4.91 | 200 | HTML 5, English |
15843 | crazyforcrust.com | 18462 | 4.91 | 200 | HTML 5, English |
15844 | watlow.com | 18463 | 4.91 | 200 | HTML 5, English |
15845 | entropymine.com | 18464 | 4.91 | 200 | No Lang |
15846 | search.sunbiz.org | 18465 | 4.91 | 200 | HTML 5, English |
15847 | planespotters.net | 18467 | 4.91 | 200 | HTML 5, English |
15848 | fireengineering.com | 18468 | 4.91 | 200 | HTML 5, English |
15849 | visitdetroit.com | 18469 | 4.91 | 200 | HTML 5, English |
15850 | web.math.princeton.edu | 18470 | 4.91 | 200 | HTML 5, English |
15851 | science.psu.edu | 18471 | 4.91 | 200 | HTML 5, English |
15852 | aperture.org | 18472 | 4.91 | 200 | HTML 5, English |
15853 | uamshealth.com | 18473 | 4.91 | 200 | HTML 5, English |
15854 | punto-informatico.it | 18474 | 4.91 | 200 | HTML 5, No Lang |
15855 | amanz.my | 18475 | 4.91 | 200 | HTML 5, English |
15856 | events.teams.microsoft.com | 18477 | 4.91 | 200 | HTML 5, English |
15857 | volkswagen-newsroom.com | 18479 | 4.91 | 200 | HTML 5, English |
15858 | lgbtqnation.com | 18480 | 4.91 | 200 | HTML 5, English |
15859 | usnews.nbcnews.com | 18481 | 4.91 | 200 | HTML 5, English |
15860 | sdna.gr | 18484 | 4.91 | 200 | HTML 5 |
15861 | events.humanitix.com | 18485 | 4.91 | 200 | HTML 5, English |
15862 | nlnet.nl | 18486 | 4.91 | 200 | HTML 5, English |
15863 | careerfoundry.com | 18487 | 4.91 | 200 | HTML 5, English |
15864 | informatica.com | 18488 | 4.91 | 200 | HTML 5, English |
15865 | docs.qq.com | 18489 | 4.91 | 200 | HTML 5, No Lang |
15866 | worldtimeserver.com | 18490 | 4.91 | 200 | HTML 5, English |
15867 | beaches.com | 18494 | 4.91 | 200 | HTML 5, English |
15868 | muz.li | 18495 | 4.91 | 200 | HTML 5, English |
15869 | econsumer.gov | 18496 | 4.91 | 200 | HTML 5, English |
15870 | daytranslations.com | 18497 | 4.91 | 200 | HTML 5, English |
15871 | cryptpad.fr | 18498 | 4.91 | 200 | No Lang |
15872 | mangazenkan.com | 18499 | 4.91 | 200 | HTML 5, No Lang |
15873 | grouplens.org | 18500 | 4.91 | 200 | HTML 5, English |
15874 | comtrade.un.org | 18501 | 4.91 | 200 | No Lang |
15875 | tobaccocontrol.bmj.com | 18502 | 4.91 | 200 | HTML 5, English |
15876 | roadsideamerica.com | 18503 | 4.91 | 200 | English, Strict |
15877 | mobitel.lk | 18504 | 4.91 | 200 | HTML 5, English |
15878 | capetownmagazine.com | 18506 | 4.91 | 200 | No Lang, Transitional |
15879 | minorityhealth.hhs.gov | 18507 | 4.91 | 200 | HTML 5, English |
15880 | worldipv6launch.org | 18508 | 4.91 | 200 | HTML 5, English |
15881 | dimsemenov.com | 18509 | 4.91 | 200 | HTML 5, No Lang |
15882 | gcore.com | 18510 | 4.91 | 200 | HTML 5, English |
15883 | disasterassistance.gov | 18512 | 4.91 | 200 | HTML 5, English |
15884 | 10news.com | 18513 | 4.91 | 200 | HTML 5, English |
15885 | harding.edu | 18514 | 4.91 | 200 | HTML 5, English |
15886 | isbnsearch.org | 18515 | 4.91 | 200 | HTML 5, English |
15887 | siberiantimes.com | 18516 | 4.91 | 200 | HTML 5, English |
15888 | ark-invest.com | 18518 | 4.91 | 200 | HTML 5, English |
15889 | sathyabama.ac.in | 18519 | 4.91 | 200 | HTML 5, English |
15890 | dataconomy.com | 18520 | 4.91 | 200 | HTML 5, English |
15891 | sennheiser.com | 18522 | 4.91 | 200 | HTML 5, English |
15892 | ncoa.org | 18523 | 4.91 | 200 | HTML 5, English |
15893 | atomicdesign.bradfrost.com | 18524 | 4.91 | 200 | HTML 5, No Lang |
15894 | electricimp.com | 18525 | 4.91 | 200 | HTML 5, English |
15895 | achievement.org | 18526 | 4.91 | 200 | HTML 5, English |
15896 | nomoreransom.org | 18527 | 4.91 | 200 | No Lang |
15897 | www3.imperial.ac.uk | 18528 | 4.91 | 200 | HTML 5, English |
15898 | newsquest.co.uk | 18529 | 4.91 | 200 | HTML 5, No Lang |
15899 | quicken.com | 18530 | 4.91 | 200 | HTML 5, English |
15900 | tcpdf.org | 18531 | 4.91 | 200 | HTML 5, English |
Data from: Open PageRank