Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
9401 | shoutout.wix.com | 10950 | 5.07 | 200 | HTML 5, English |
9402 | cmt.com | 10951 | 5.07 | 200 | HTML 5, English |
9403 | wels.net | 10952 | 5.07 | 200 | HTML 5, English |
9404 | ext.vt.edu | 10954 | 5.07 | 200 | HTML 5, English |
9405 | metropcs.com | 10955 | 5.07 | 200 | HTML 5, English |
9406 | digital.gov | 10956 | 5.07 | 200 | No Lang |
9407 | ojs.aaai.org | 10957 | 5.07 | 200 | HTML 5, English |
9408 | worldtimebuddy.com | 10958 | 5.07 | 200 | English, Transitional |
9409 | yelp.ca | 10959 | 5.07 | 200 | HTML 5, English |
9410 | theyworkforyou.com | 10961 | 5.07 | 200 | HTML 5, English |
9411 | plus.maths.org | 10962 | 5.07 | 200 | HTML 5, English |
9412 | doxygen.nl | 10963 | 5.07 | 200 | HTML 5, English |
9413 | getmyboat.com | 10965 | 5.07 | 200 | HTML 5, English |
9414 | royal.uk | 10966 | 5.07 | 200 | HTML 5, English |
9415 | academy.binance.com | 10967 | 5.07 | 200 | HTML 5, English |
9416 | joinup.ec.europa.eu | 10968 | 5.07 | 200 | HTML 5, English |
9417 | mynet.co.jp | 10970 | 5.07 | 200 | HTML 5 |
9418 | beforeitsnews.com | 10971 | 5.07 | 200 | HTML 5, English |
9419 | jio.com | 10972 | 5.07 | 200 | HTML 5, English |
9420 | thinkific.com | 10973 | 5.07 | 200 | HTML 5, English |
9421 | jhuapl.edu | 10975 | 5.07 | 200 | HTML 5, English |
9422 | ghirardelli.com | 10976 | 5.07 | 200 | HTML 5, English |
9423 | new.mta.info | 10977 | 5.07 | 200 | English |
9424 | uni-bamberg.de | 10978 | 5.07 | 200 | HTML 5 |
9425 | mapsplatform.googleblog.com | 10980 | 5.07 | 200 | HTML 5, English |
9426 | pitchero.com | 10981 | 5.07 | 200 | HTML 5, English |
9427 | habitica.com | 10982 | 5.07 | 200 | HTML 5, No Lang |
9428 | m.bild.de | 10984 | 5.07 | 200 | HTML 5 |
9429 | dashif.org | 10985 | 5.07 | 200 | English |
9430 | masters.com | 10986 | 5.07 | 200 | HTML 5, English |
9431 | depatisnet.dpma.de | 10987 | 5.07 | 200 | HTML 5, No Lang |
9432 | dec.ny.gov | 10988 | 5.07 | 200 | HTML 5, English |
9433 | id3.org | 10989 | 5.07 | 200 | No Lang, Strict |
9434 | therealdeal.com | 10990 | 5.07 | 200 | HTML 5, English |
9435 | en.forums.wordpress.com | 10992 | 5.07 | 200 | HTML 5, No Lang |
9436 | electronicintifada.net | 10994 | 5.07 | 200 | HTML 5, English |
9437 | tcl.fr | 10995 | 5.07 | 200 | HTML 5 |
9438 | nceas.ucsb.edu | 10996 | 5.07 | 200 | HTML 5, English |
9439 | kryogenix.org | 10997 | 5.07 | 200 | HTML 5, English |
9440 | legis.la.gov | 10998 | 5.07 | 200 | HTML 5, English |
9441 | blendle.com | 10999 | 5.07 | 200 | HTML 5, English |
9442 | julialang.org | 11000 | 5.07 | 200 | HTML 5, English |
9443 | orange.pl | 11001 | 5.07 | 200 | HTML 5 |
9444 | openssh.com | 11002 | 5.07 | 200 | HTML 5, English |
9445 | inkitt.com | 11003 | 5.07 | 200 | HTML 5, English |
9446 | swift.org | 11004 | 5.07 | 200 | HTML 5, English |
9447 | ec.gc.ca | 11005 | 5.07 | 200 | HTML 5, English |
9448 | moh.gov.sg | 11006 | 5.07 | 200 | HTML 5, English |
9449 | aufeminin.com | 11007 | 5.07 | 200 | HTML 5 |
9450 | u-blox.com | 11008 | 5.07 | 200 | HTML 5, English |
9451 | trusona.com | 11009 | 5.07 | 200 | HTML 5, English |
9452 | websummit.com | 11010 | 5.07 | 200 | HTML 5, English |
9453 | latercera.com | 11011 | 5.07 | 200 | HTML 5, No Lang |
9454 | ee.co.uk | 11012 | 5.07 | 200 | HTML 5, English |
9455 | archives.nd.edu | 11013 | 5.07 | 200 | HTML 5, No Lang |
9456 | standardmedia.co.ke | 11014 | 5.07 | 200 | HTML 5, English |
9457 | sanook.com | 11015 | 5.07 | 200 | HTML 5 |
9458 | rspb.org.uk | 11016 | 5.07 | 200 | HTML 5, English |
9459 | trac.osgeo.org | 11017 | 5.07 | 200 | No Lang |
9460 | cs.purdue.edu | 11019 | 5.07 | 200 | HTML 5, English |
9461 | secunia.com | 11020 | 5.07 | 200 | HTML 5, English |
9462 | aliceblueonline.com | 11021 | 5.07 | 200 | HTML 5, English |
9463 | jaxx.io | 11022 | 5.07 | 200 | HTML 5, English |
9464 | uni-bielefeld.de | 11023 | 5.07 | 200 | HTML 5, English |
9465 | cims.nyu.edu | 11024 | 5.07 | 200 | HTML 5, English |
9466 | openrouter.ai | 11025 | 5.07 | 200 | HTML 5, English |
9467 | play.google | 11026 | 5.07 | 200 | HTML 5, English |
9468 | global.sharp | 11027 | 5.07 | 200 | HTML 5, English |
9469 | gstreamer.freedesktop.org | 11029 | 5.07 | 200 | No Lang |
9470 | docs.like.co | 11030 | 5.07 | 200 | HTML 5, English |
9471 | frag-mutti.de | 11031 | 5.07 | 200 | HTML 5 |
9472 | hug-ge.ch | 11032 | 5.07 | 200 | HTML 5 |
9473 | imagecomics.com | 11033 | 5.07 | 200 | HTML 5, English |
9474 | guru3d.com | 11035 | 5.07 | 200 | English |
9475 | illinois.gov | 11036 | 5.07 | 200 | HTML 5, English |
9476 | timharford.com | 11037 | 5.07 | 200 | HTML 5, English |
9477 | loomio.org | 11038 | 5.07 | 200 | HTML 5, English |
9478 | mid-day.com | 11039 | 5.07 | 200 | English |
9479 | docs.ansible.com | 11040 | 5.07 | 200 | HTML 5, English |
9480 | ekantipur.com | 11041 | 5.07 | 200 | HTML 5 |
9481 | en.interfax.com.ua | 11042 | 5.07 | 200 | HTML 5, English |
9482 | researcherid.com | 11044 | 5.07 | 200 | HTML 5, English |
9483 | empik.com | 11045 | 5.07 | 200 | HTML 5 |
9484 | plone.org | 11047 | 5.07 | 200 | HTML 5, English |
9485 | techpowerup.com | 11049 | 5.05 | 200 | HTML 5, English |
9486 | data.bls.gov | 11051 | 5.05 | 200 | HTML 5, English |
9487 | scottberkun.com | 11052 | 5.05 | 200 | HTML 5, English |
9488 | teamsystem.com | 11053 | 5.05 | 200 | |
9489 | stepik.org | 11054 | 5.05 | 200 | HTML 5, No Lang |
9490 | desalasworks.com | 11056 | 5.05 | 200 | HTML 5, English |
9491 | vendeeglobe.org | 11058 | 5.05 | 200 | HTML 5 |
9492 | feastingathome.com | 11060 | 5.05 | 200 | HTML 5, English |
9493 | wordhippo.com | 11061 | 5.05 | 200 | HTML 5, English |
9494 | ruhr-uni-bochum.de | 11062 | 5.05 | 200 | HTML 5 |
9495 | scryfall.com | 11064 | 5.05 | 200 | HTML 5, English |
9496 | docs.plesk.com | 11065 | 5.05 | 200 | HTML 5, English |
9497 | moosend.com | 11066 | 5.05 | 200 | HTML 5, English |
9498 | ledevoir.com | 11068 | 5.05 | 200 | HTML 5 |
9499 | kgw.com | 11069 | 5.05 | 200 | HTML 5, English |
9500 | sidn.nl | 11070 | 5.05 | 200 | HTML 5 |
Data from: Open PageRank