Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
9801 | singularityhub.com | 11424 | 5.05 | 200 | HTML 5, English |
9802 | vogue.es | 11425 | 5.05 | 200 | HTML 5 |
9803 | answerforce.com | 11426 | 5.05 | 200 | HTML 5, English |
9804 | business.qld.gov.au | 11427 | 5.05 | 200 | HTML 5, English |
9805 | folha.com.br | 11428 | 5.05 | 200 | HTML 5 |
9806 | meritalk.com | 11429 | 5.05 | 200 | HTML 5, English |
9807 | helpdesk.com | 11430 | 5.05 | 200 | HTML 5, English |
9808 | turkishairlines.com | 11431 | 5.05 | 200 | HTML 5, English |
9809 | fentybeauty.com | 11432 | 5.05 | 200 | HTML 5, English |
9810 | publib.boulder.ibm.com | 11433 | 5.05 | 200 | No Lang |
9811 | aia.org | 11436 | 5.05 | 200 | HTML 5, English |
9812 | dshs.texas.gov | 11437 | 5.05 | 200 | HTML 5, English |
9813 | tgifridays.com | 11438 | 5.05 | 200 | HTML 5, English |
9814 | gesis.org | 11439 | 5.05 | 200 | |
9815 | glossy.co | 11440 | 5.05 | 200 | HTML 5, English |
9816 | choosemyplate.gov | 11441 | 5.05 | 200 | HTML 5, English |
9817 | tmall.com | 11442 | 5.05 | 200 | HTML 5 |
9818 | nbcsandiego.com | 11443 | 5.05 | 200 | HTML 5, English |
9819 | blackpast.org | 11444 | 5.05 | 200 | HTML 5, English |
9820 | axisbank.com | 11445 | 5.05 | 200 | HTML 5, English |
9821 | tutiempo.net | 11446 | 5.05 | 200 | HTML 5 |
9822 | vogue.co.jp | 11448 | 5.05 | 200 | HTML 5 |
9823 | njt.hu | 11449 | 5.05 | 200 | HTML 5 |
9824 | ccrma.stanford.edu | 11451 | 5.05 | 200 | English, Strict |
9825 | inria.hal.science | 11452 | 5.05 | 200 | HTML 5, English |
9826 | music.163.com | 11453 | 5.05 | 200 | HTML 5, No Lang |
9827 | touchofmodern.com | 11454 | 5.05 | 200 | HTML 5, English |
9828 | phrase.com | 11455 | 5.05 | 200 | HTML 5, English |
9829 | nbc.ca | 11456 | 5.05 | 200 | HTML 5, English |
9830 | apps.devilhunter.net | 11457 | 5.05 | 200 | HTML 5, English |
9831 | hinge.co | 11458 | 5.05 | 200 | HTML 5, English |
9832 | heritagedaily.com | 11459 | 5.05 | 200 | HTML 5, English |
9833 | base64decode.org | 11460 | 5.05 | 200 | HTML 5, English |
9834 | capitalfactory.com | 11461 | 5.05 | 200 | HTML 5, English |
9835 | blog.research.google | 11462 | 5.05 | 200 | HTML 5, English |
9836 | windowsreport.com | 11463 | 5.05 | 200 | HTML 5, English |
9837 | translatewiki.net | 11464 | 5.05 | 200 | HTML 5, No Lang |
9838 | openlab.citytech.cuny.edu | 11465 | 5.05 | 200 | English, Transitional |
9839 | mo.gov | 11466 | 5.05 | 200 | HTML 5, English |
9840 | getjobber.com | 11467 | 5.05 | 200 | HTML 5, English |
9841 | forbes.com.mx | 11469 | 5.05 | 200 | HTML 5 |
9842 | docs.mongodb.com | 11470 | 5.05 | 200 | HTML 5, English |
9843 | sideshow.com | 11471 | 5.05 | 200 | HTML 5, English |
9844 | mentor.com | 11472 | 5.05 | 200 | HTML 5, English |
9845 | bulma.io | 11473 | 5.05 | 200 | HTML 5, English |
9846 | ladygaga.com | 11475 | 5.05 | 200 | HTML 5, English |
9847 | open.alberta.ca | 11476 | 5.05 | 200 | HTML 5, English |
9848 | symfony.com | 11477 | 5.05 | 200 | HTML 5, English |
9849 | nominatim.openstreetmap.org | 11479 | 5.05 | 200 | HTML 5, English |
9850 | slideslive.com | 11480 | 5.05 | 200 | HTML 5, English |
9851 | switch.ch | 11481 | 5.05 | 200 | HTML 5, No Lang |
9852 | occ.gov | 11482 | 5.05 | 200 | HTML 5, English |
9853 | estsecurity.com | 11483 | 5.05 | 200 | HTML 5, No Lang |
9854 | personal.lse.ac.uk | 11487 | 5.05 | 200 | No Lang |
9855 | toom.de | 11488 | 5.05 | 200 | HTML 5 |
9856 | noguchi.org | 11489 | 5.05 | 200 | HTML 5, English |
9857 | abstractsonline.com | 11490 | 5.05 | 200 | HTML 5, English |
9858 | cs.ucr.edu | 11492 | 5.05 | 200 | HTML 5, English |
9859 | beta.character.ai | 11493 | 5.05 | 200 | HTML 5, English |
9860 | snowflake.com | 11494 | 5.05 | 200 | HTML 5, English |
9861 | id.loc.gov | 11496 | 5.05 | 200 | No Lang |
9862 | nearbynow.co | 11497 | 5.05 | 200 | HTML 5, No Lang |
9863 | flytap.com | 11498 | 5.05 | 200 | HTML 5, English |
9864 | capitalfm.com | 11499 | 5.05 | 200 | HTML 5, English |
9865 | ibtimes.co.in | 11502 | 5.05 | 200 | HTML 5, English |
9866 | landsat.usgs.gov | 11503 | 5.05 | 200 | HTML 5, English |
9867 | sport1.de | 11504 | 5.05 | 200 | HTML 5 |
9868 | sites.research.google | 11505 | 5.05 | 200 | HTML 5, English |
9869 | mobot.org | 11506 | 5.05 | 200 | HTML 5, English |
9870 | ben.balter.com | 11507 | 5.05 | 200 | HTML 5, English |
9871 | inbound.com | 11508 | 5.05 | 200 | HTML 5, English |
9872 | news.northeastern.edu | 11510 | 5.05 | 200 | HTML 5, English |
9873 | yes24.com | 11512 | 5.05 | 200 | HTML 5 |
9874 | usaa.com | 11514 | 5.05 | 200 | HTML 5, English |
9875 | bed-booking.com | 11515 | 5.05 | 200 | HTML 5, English |
9876 | campercontact.com | 11517 | 5.05 | 200 | HTML 5, No Lang |
9877 | disabilityin.org | 11518 | 5.05 | 200 | HTML 5, English |
9878 | framer.com | 11519 | 5.05 | 200 | HTML 5, English |
9879 | mautic.org | 11520 | 5.05 | 200 | HTML 5, English |
9880 | medindia.net | 11521 | 5.05 | 200 | HTML 5, English |
9881 | earthexplorer.usgs.gov | 11522 | 5.05 | 200 | HTML 5, English |
9882 | wkyc.com | 11523 | 5.05 | 200 | HTML 5, English |
9883 | saatchionline.com | 11524 | 5.05 | 200 | HTML 5, English |
9884 | rdw.nl | 11525 | 5.05 | 200 | HTML 5 |
9885 | redtri.com | 11526 | 5.05 | 200 | HTML 5, English |
9886 | unescap.org | 11527 | 5.05 | 200 | HTML 5, English |
9887 | react.semantic-ui.com | 11528 | 5.05 | 200 | HTML 5, English |
9888 | genome.ucsc.edu | 11529 | 5.05 | 200 | HTML 5, No Lang |
9889 | thuvienphapluat.vn | 11530 | 5.05 | 200 | No Lang, Transitional |
9890 | sciway.net | 11531 | 5.05 | 200 | HTML 5, No Lang |
9891 | oecd-nea.org | 11532 | 5.05 | 200 | HTML 5, English |
9892 | worldofwarcraft.com | 11533 | 5.05 | 200 | HTML 5, English |
9893 | dictionary.apa.org | 11534 | 5.05 | 200 | HTML 5, English |
9894 | tablericons.com | 11535 | 5.05 | 200 | HTML 5, English |
9895 | vix.com | 11536 | 5.05 | 200 | HTML 5, No Lang |
9896 | gettyimages.it | 11537 | 5.05 | 200 | HTML 5 |
9897 | earth911.com | 11538 | 5.05 | 200 | HTML 5, English |
9898 | bulletjournal.com | 11539 | 5.05 | 200 | HTML 5, English |
9899 | barn2.co.uk | 11540 | 5.05 | 200 | HTML 5, English |
9900 | gps.gov | 11541 | 5.05 | 200 | English, Transitional |
Data from: Open PageRank