Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
8001 | earth.org | 9343 | 5.12 | 200 | HTML 5, English |
8002 | tf1info.fr | 9344 | 5.12 | 200 | HTML 5 |
8003 | openshot.org | 9346 | 5.12 | 200 | HTML 5, English |
8004 | tindie.com | 9347 | 5.12 | 200 | HTML 5, English |
8005 | idw-online.de | 9348 | 5.12 | 200 | HTML 5 |
8006 | watchguard.com | 9349 | 5.12 | 200 | HTML 5, English |
8007 | stats.govt.nz | 9350 | 5.12 | 200 | HTML 5, English |
8008 | m.twitch.tv | 9351 | 5.12 | 200 | HTML 5, No Lang |
8009 | lbry.com | 9353 | 5.12 | 200 | HTML 5, English |
8010 | codeclimate.com | 9354 | 5.12 | 200 | HTML 5, English |
8011 | wordpress.stackexchange.com | 9355 | 5.12 | 200 | HTML 5, English |
8012 | qantas.com | 9356 | 5.12 | 200 | HTML 5, English |
8013 | planoly.com | 9357 | 5.12 | 200 | HTML 5, English |
8014 | coolmaterial.com | 9358 | 5.12 | 200 | HTML 5, English |
8015 | min.io | 9359 | 5.12 | 200 | HTML 5, English |
8016 | ark.intel.com | 9360 | 5.12 | 200 | English |
8017 | media.ford.com | 9361 | 5.12 | 200 | HTML 5, English |
8018 | strikingly.com | 9362 | 5.12 | 200 | HTML 5, English |
8019 | raiders.com | 9363 | 5.12 | 200 | HTML 5, English |
8020 | prokerala.com | 9364 | 5.12 | 200 | HTML 5, No Lang |
8021 | dev.windows.com | 9365 | 5.12 | 200 | HTML 5, English |
8022 | cabi.org | 9366 | 5.12 | 200 | HTML 5, English |
8023 | hopper.com | 9367 | 5.12 | 200 | HTML 5, English |
8024 | rememberthemilk.com | 9368 | 5.12 | 200 | English, Transitional |
8025 | mapsofworld.com | 9369 | 5.12 | 200 | HTML 5, English |
8026 | fu-berlin.de | 9370 | 5.12 | 200 | HTML 5 |
8027 | wapo.st | 9371 | 5.12 | 200 | HTML 5, English |
8028 | kabbage.com | 9372 | 5.12 | 200 | HTML 5, No Lang |
8029 | aflcio.org | 9374 | 5.12 | 200 | HTML 5, English |
8030 | thefader.com | 9376 | 5.12 | 200 | HTML 5, English |
8031 | london.edu | 9377 | 5.12 | 200 | HTML 5, English |
8032 | teamrubiconusa.org | 9378 | 5.12 | 200 | HTML 5, English |
8033 | storage.courtlistener.com | 9380 | 5.12 | 200 | No Lang |
8034 | masonry.desandro.com | 9381 | 5.12 | 200 | HTML 5, English |
8035 | pythonhosted.org | 9383 | 5.12 | 200 | No Lang |
8036 | opg.optica.org | 9384 | 5.12 | 200 | HTML 5, English |
8037 | dyn.com | 9385 | 5.12 | 200 | HTML 5, English |
8038 | blog.usejournal.com | 9386 | 5.12 | 200 | HTML 5, English |
8039 | asc.upenn.edu | 9387 | 5.12 | 200 | HTML 5, English |
8040 | nida.nih.gov | 9388 | 5.12 | 200 | HTML 5, English |
8041 | fi.google.com | 9389 | 5.12 | 200 | HTML 5, English |
8042 | lomography.com | 9390 | 5.12 | 200 | HTML 5, English |
8043 | valvesoftware.com | 9391 | 5.12 | 200 | HTML 5, No Lang |
8044 | tue.nl | 9392 | 5.12 | 200 | HTML 5, English |
8045 | kfw.de | 9393 | 5.12 | 200 | HTML 5 |
8046 | courses.lumenlearning.com | 9394 | 5.12 | 200 | HTML 5, English |
8047 | city.ac.uk | 9395 | 5.12 | 200 | HTML 5, English |
8048 | lsst.org | 9396 | 5.12 | 200 | HTML 5, English |
8049 | donotcall.gov | 9397 | 5.12 | 200 | HTML 5, No Lang |
8050 | theory.stanford.edu | 9398 | 5.12 | 200 | No Lang |
8051 | firstvoices.com | 9399 | 5.12 | 200 | HTML 5, English |
8052 | environment.ec.europa.eu | 9400 | 5.12 | 200 | HTML 5, English |
8053 | zebra.com | 9401 | 5.12 | 200 | HTML 5, English |
8054 | flowwow.com | 9402 | 5.12 | 200 | HTML 5, No Lang |
8055 | umap.openstreetmap.fr | 9403 | 5.12 | 200 | HTML 5, No Lang |
8056 | kik.com | 9404 | 5.12 | 200 | HTML 5, English |
8057 | cs.berkeley.edu | 9405 | 5.12 | 200 | HTML 5, English |
8058 | nidirect.gov.uk | 9406 | 5.12 | 200 | HTML 5, English |
8059 | join.me | 9407 | 5.12 | 200 | HTML 5, English |
8060 | coop.co.uk | 9408 | 5.12 | 200 | HTML 5, No Lang |
8061 | skincancer.org | 9409 | 5.12 | 200 | HTML 5, English |
8062 | nominatim.org | 9410 | 5.12 | 200 | HTML 5, No Lang |
8063 | dpi.nsw.gov.au | 9411 | 5.12 | 200 | HTML 5, English |
8064 | nea.com | 9413 | 5.12 | 200 | HTML 5, English |
8065 | myvi.in | 9414 | 5.12 | 200 | HTML 5, English |
8066 | reed.co.uk | 9415 | 5.12 | 200 | HTML 5, English |
8067 | actualitte.com | 9416 | 5.12 | 200 | HTML 5, No Lang |
8068 | flashbak.com | 9417 | 5.12 | 200 | HTML 5, English |
8069 | conrad.de | 9419 | 5.12 | 200 | HTML 5 |
8070 | cnblogs.com | 9420 | 5.12 | 200 | HTML 5 |
8071 | x-plane.com | 9421 | 5.12 | 200 | HTML 5, English |
8072 | gpsies.com | 9422 | 5.12 | 200 | HTML 5, English |
8073 | edsource.org | 9423 | 5.12 | 200 | HTML 5, English |
8074 | wbez.org | 9424 | 5.12 | 200 | HTML 5, English |
8075 | gratisography.com | 9425 | 5.12 | 200 | HTML 5, English |
8076 | flexmls.com | 9426 | 5.12 | 200 | HTML 5, English |
8077 | gatech.edu | 9427 | 5.12 | 200 | HTML 5, English |
8078 | excelsior.com.mx | 9428 | 5.12 | 200 | HTML 5 |
8079 | infogr.am | 9429 | 5.12 | 200 | HTML 5, English |
8080 | scania.com | 9430 | 5.12 | 200 | HTML 5, English |
8081 | bbfc.co.uk | 9431 | 5.12 | 200 | HTML 5, English |
8082 | constitution.org | 9432 | 5.12 | 200 | HTML 5, English |
8083 | aerisweather.com | 9434 | 5.12 | 200 | HTML 5, English |
8084 | iter.org | 9436 | 5.12 | 200 | HTML 5, English |
8085 | abc7ny.com | 9437 | 5.12 | 200 | HTML 5, English |
8086 | tasteofcountry.com | 9438 | 5.12 | 200 | HTML 5, English |
8087 | iza.org | 9440 | 5.12 | 200 | HTML 5, English |
8088 | dkfz.de | 9441 | 5.12 | 200 | HTML 5 |
8089 | uis.unesco.org | 9442 | 5.12 | 200 | HTML 5, English |
8090 | dwr.com | 9443 | 5.12 | 200 | HTML 5, English |
8091 | webmaster-source.com | 9445 | 5.12 | 200 | HTML 5, No Lang |
8092 | rtcg.me | 9446 | 5.12 | 200 | HTML 5, English |
8093 | robinhood.com | 9447 | 5.12 | 200 | HTML 5, No Lang |
8094 | hls.harvard.edu | 9448 | 5.12 | 200 | HTML 5, English |
8095 | win.tue.nl | 9449 | 5.12 | 200 | HTML 5, English |
8096 | app.grammarly.com | 9450 | 5.12 | 200 | HTML 5, English |
8097 | incubator.apache.org | 9452 | 5.12 | 200 | HTML 5, English |
8098 | www-128.ibm.com | 9453 | 5.12 | 200 | HTML 5, English |
8099 | organicmaps.app | 9454 | 5.12 | 200 | HTML 5, English |
8100 | nparks.gov.sg | 9455 | 5.12 | 200 | HTML 5, English |
Data from: Open PageRank