Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
8301 | sonypictures.com | 9690 | 5.11 | 200 | HTML 5, English |
8302 | ada.org | 9691 | 5.11 | 200 | HTML 5, No Lang |
8303 | isotope.metafizzy.co | 9692 | 5.11 | 200 | HTML 5, English |
8304 | melia.com | 9693 | 5.11 | 200 | HTML 5, English |
8305 | gaana.com | 9694 | 5.11 | 200 | HTML 5, English |
8306 | spokeo.com | 9695 | 5.11 | 200 | HTML 5, English |
8307 | mastodon.bida.im | 9696 | 5.11 | 200 | HTML 5, English |
8308 | usanetwork.com | 9697 | 5.11 | 200 | HTML 5, English |
8309 | orange.com | 9698 | 5.11 | 200 | HTML 5, English |
8310 | isap.sejm.gov.pl | 9699 | 5.11 | 200 | HTML 5, English |
8311 | 36kr.com | 9700 | 5.11 | 200 | HTML 5, No Lang |
8312 | mnot.net | 9702 | 5.11 | 200 | HTML 5, English |
8313 | visme.co | 9703 | 5.11 | 200 | HTML 5, English |
8314 | clarifai.com | 9704 | 5.11 | 200 | HTML 5, English |
8315 | thesundaily.my | 9706 | 5.11 | 200 | HTML 5, English |
8316 | syracuse.com | 9707 | 5.11 | 200 | HTML 5, English |
8317 | cnnespanol.cnn.com | 9708 | 5.11 | 200 | HTML 5 |
8318 | informatik.uni-trier.de | 9709 | 5.11 | 200 | HTML 5 |
8319 | ulb.ac.be | 9710 | 5.11 | 200 | HTML 5 |
8320 | codezine.jp | 9711 | 5.11 | 200 | HTML 5 |
8321 | developer.arm.com | 9713 | 5.11 | 200 | HTML 5, English |
8322 | wfla.com | 9714 | 5.11 | 200 | HTML 5, English |
8323 | opencorporates.com | 9715 | 5.11 | 200 | No Lang |
8324 | bournemouthecho.co.uk | 9716 | 5.11 | 200 | HTML 5, English |
8325 | destination360.com | 9717 | 5.11 | 200 | HTML 5, English |
8326 | pdffiller.com | 9719 | 5.11 | 200 | HTML 5, English |
8327 | excite.co.jp | 9720 | 5.11 | 200 | HTML 5 |
8328 | churchofengland.org | 9721 | 5.11 | 200 | HTML 5, English |
8329 | tubebuddy.com | 9722 | 5.11 | 200 | HTML 5, English |
8330 | srpnet.com | 9723 | 5.11 | 200 | HTML 5, English |
8331 | imls.gov | 9724 | 5.11 | 200 | English |
8332 | mural.co | 9725 | 5.11 | 200 | HTML 5, English |
8333 | tablespoon.com | 9726 | 5.11 | 200 | HTML 5, English |
8334 | infosec.exchange | 9727 | 5.11 | 200 | HTML 5, English |
8335 | codame.com | 9728 | 5.11 | 200 | HTML 5, English |
8336 | schiphol.nl | 9729 | 5.11 | 200 | HTML 5, English |
8337 | stheadline.com | 9730 | 5.11 | 200 | HTML 5 |
8338 | ineteconomics.org | 9731 | 5.11 | 200 | HTML 5, English |
8339 | manager-magazin.de | 9733 | 5.11 | 200 | HTML 5 |
8340 | papermag.com | 9734 | 5.11 | 200 | HTML 5, English |
8341 | cpomagazine.com | 9735 | 5.11 | 200 | HTML 5, English |
8342 | llvm.org | 9736 | 5.11 | 200 | No Lang, Strict |
8343 | brusselsairlines.com | 9737 | 5.11 | 200 | HTML 5, English |
8344 | sleepcycle.com | 9738 | 5.11 | 200 | HTML 5, English |
8345 | concordia.ca | 9739 | 5.11 | 200 | HTML 5, English |
8346 | wits.ac.za | 9740 | 5.11 | 200 | HTML 5, English |
8347 | stereophile.com | 9741 | 5.11 | 200 | English, Transitional |
8348 | cs.ru.nl | 9742 | 5.11 | 200 | No Lang |
8349 | tophat.com | 9744 | 5.11 | 200 | HTML 5, English |
8350 | indico.cern.ch | 9745 | 5.11 | 200 | HTML 5, English |
8351 | books.google.nl | 9746 | 5.11 | 200 | HTML 5, No Lang |
8352 | mapicons.mapsmarker.com | 9747 | 5.11 | 200 | HTML 5, English |
8353 | linuxmint.com | 9748 | 5.11 | 200 | HTML 5, English |
8354 | harpercollins.ca | 9749 | 5.11 | 200 | HTML 5, English |
8355 | xeno-canto.org | 9750 | 5.11 | 200 | HTML 5, No Lang |
8356 | cdn.datatables.net | 9751 | 5.11 | 200 | HTML 5, English |
8357 | jambands.com | 9752 | 5.11 | 200 | HTML 5, English |
8358 | icd.who.int | 9753 | 5.11 | 200 | No Lang |
8359 | coveteur.com | 9754 | 5.11 | 200 | HTML 5, English |
8360 | gdcvault.com | 9755 | 5.11 | 200 | HTML 5, No Lang |
8361 | movieinsider.com | 9757 | 5.11 | 200 | HTML 5, English |
8362 | uab.cat | 9758 | 5.11 | 200 | HTML 5 |
8363 | nongnu.org | 9760 | 5.11 | 200 | English, Transitional |
8364 | sigchi.org | 9761 | 5.11 | 200 | HTML 5, English |
8365 | betterup.com | 9762 | 5.11 | 200 | HTML 5, English |
8366 | turkcell.com.tr | 9763 | 5.11 | 200 | HTML 5 |
8367 | makerbot.com | 9765 | 5.11 | 200 | HTML 5, English |
8368 | arbonne.com | 9766 | 5.11 | 200 | HTML 5, English |
8369 | bloodhorse.com | 9767 | 5.11 | 200 | No Lang |
8370 | support.symantec.com | 9768 | 5.11 | 200 | HTML 5, English |
8371 | vrtx.com | 9769 | 5.09 | 200 | HTML 5, English |
8372 | sorbs.net | 9771 | 5.09 | 200 | No Lang, Transitional |
8373 | conifer.rhizome.org | 9772 | 5.09 | 200 | HTML 5, English |
8374 | news.sophos.com | 9773 | 5.09 | 200 | HTML 5, English |
8375 | pygments.org | 9774 | 5.09 | 200 | HTML 5, English |
8376 | aldi.us | 9775 | 5.09 | 200 | HTML 5, English |
8377 | digilander.libero.it | 9776 | 5.09 | 200 | HTML 5 |
8378 | commerce.senate.gov | 9778 | 5.09 | 200 | HTML 5, English |
8379 | unwto.org | 9779 | 5.09 | 200 | HTML 5, English |
8380 | tel.archives-ouvertes.fr | 9780 | 5.09 | 200 | HTML 5, English |
8381 | cepal.org | 9781 | 5.09 | 200 | HTML 5 |
8382 | lenta.ru | 9782 | 5.09 | 200 | HTML 5 |
8383 | skitterphoto.com | 9783 | 5.09 | 200 | HTML 5, English |
8384 | scalar.usc.edu | 9784 | 5.09 | 200 | HTML 5, English |
8385 | scholar.google.de | 9785 | 5.09 | 200 | HTML 5, No Lang |
8386 | form.run | 9786 | 5.09 | 200 | HTML 5 |
8387 | tass.ru | 9787 | 5.09 | 200 | HTML 5 |
8388 | sno.phy.queensu.ca | 9788 | 5.09 | 200 | No Lang, Transitional |
8389 | ohloh.net | 9789 | 5.09 | 200 | HTML 5, No Lang |
8390 | et.wikipedia.org | 9791 | 5.09 | 200 | HTML 5, No Lang |
8391 | angularjs.org | 9792 | 5.09 | 200 | HTML 5, English |
8392 | spacetelescope.org | 9793 | 5.09 | 200 | HTML 5, English |
8393 | muslim-library.com | 9795 | 5.09 | 200 | HTML 5 |
8394 | scholar.google.nl | 9797 | 5.09 | 200 | HTML 5, No Lang |
8395 | smhi.se | 9798 | 5.09 | 200 | HTML 5 |
8396 | stackpath.com | 9799 | 5.09 | 200 | HTML 5, No Lang |
8397 | neb.com | 9800 | 5.09 | 200 | HTML 5, English |
8398 | vtnews.vt.edu | 9801 | 5.09 | 200 | HTML 5, English |
8399 | trekearth.com | 9802 | 5.09 | 200 | HTML 5, English |
8400 | runkeeper.com | 9803 | 5.09 | 200 | HTML 5, English |
Data from: Open PageRank