Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
7201 | globalcitizen.org | 8418 | 5.16 | 200 | HTML 5, No Lang |
7202 | community.atlassian.com | 8419 | 5.16 | 200 | HTML 5, English |
7203 | cex.io | 8421 | 5.16 | 200 | HTML 5, English |
7204 | stocktwits.com | 8422 | 5.16 | 200 | HTML 5, English |
7205 | restaurant.com | 8423 | 5.16 | 200 | HTML 5, English |
7206 | childmind.org | 8424 | 5.16 | 200 | HTML 5, English |
7207 | nichd.nih.gov | 8425 | 5.16 | 200 | HTML 5, English |
7208 | cyclingweekly.com | 8426 | 5.16 | 200 | HTML 5, English |
7209 | foodrepublic.com | 8427 | 5.16 | 200 | HTML 5, English |
7210 | brendangregg.com | 8428 | 5.16 | 200 | No Lang |
7211 | spreadshop.com | 8430 | 5.16 | 200 | HTML 5, English |
7212 | guttmacher.org | 8431 | 5.16 | 200 | HTML 5, English |
7213 | houstonpress.com | 8432 | 5.16 | 200 | HTML 5, English |
7214 | koreaherald.com | 8433 | 5.16 | 200 | HTML 5, English |
7215 | techxplore.com | 8434 | 5.16 | 200 | HTML 5, English |
7216 | positivepsychology.com | 8435 | 5.16 | 200 | HTML 5, English |
7217 | digitalcommons.unl.edu | 8436 | 5.16 | 200 | HTML 5, English |
7218 | knowledge.autodesk.com | 8437 | 5.16 | 200 | HTML 5, English |
7219 | ember-climate.org | 8438 | 5.16 | 200 | HTML 5, English |
7220 | wagwalking.com | 8439 | 5.16 | 200 | HTML 5, English |
7221 | themoscowtimes.com | 8440 | 5.16 | 200 | HTML 5, English |
7222 | phpied.com | 8441 | 5.16 | 200 | HTML 5, No Lang |
7223 | roots.io | 8442 | 5.16 | 200 | HTML 5, English |
7224 | observador.pt | 8443 | 5.16 | 200 | HTML 5 |
7225 | trt.net.tr | 8444 | 5.16 | 200 | HTML 5 |
7226 | upworthy.com | 8445 | 5.16 | 200 | HTML 5, English |
7227 | datacite.org | 8446 | 5.16 | 200 | HTML 5, English |
7228 | psiphon.ca | 8447 | 5.16 | 200 | HTML 5, No Lang |
7229 | thediplomat.com | 8449 | 5.16 | 200 | HTML 5, English |
7230 | 16personalities.com | 8450 | 5.16 | 200 | HTML 5, English |
7231 | blackenterprise.com | 8451 | 5.16 | 200 | HTML 5, English |
7232 | njit.edu | 8452 | 5.16 | 200 | HTML 5, English |
7233 | francebleu.fr | 8453 | 5.16 | 200 | HTML 5 |
7234 | mygov.in | 8454 | 5.16 | 200 | HTML 5, English |
7235 | conceptcarz.com | 8455 | 5.16 | 200 | HTML 5, English |
7236 | leapmotion.com | 8456 | 5.16 | 200 | HTML 5, English |
7237 | spatie.be | 8458 | 5.16 | 200 | HTML 5, English |
7238 | pixilart.com | 8459 | 5.16 | 200 | HTML 5, English |
7239 | videolectures.net | 8460 | 5.16 | 200 | HTML 5, English |
7240 | indiatvnews.com | 8461 | 5.16 | 200 | HTML 5, English |
7241 | template.net | 8462 | 5.16 | 200 | HTML 5, English |
7242 | isi.edu | 8463 | 5.16 | 200 | HTML 5, English |
7243 | maps.google.ca | 8464 | 5.16 | 200 | HTML 5, English |
7244 | swift.com | 8465 | 5.16 | 200 | HTML 5, English |
7245 | blogs.cisco.com | 8466 | 5.16 | 200 | HTML 5, English |
7246 | plants.usda.gov | 8467 | 5.16 | 200 | HTML 5, English |
7247 | netsuite.com | 8468 | 5.16 | 200 | HTML 5, English |
7248 | dove.com | 8469 | 5.16 | 200 | HTML 5, English |
7249 | journals.ametsoc.org | 8470 | 5.16 | 200 | HTML 5, English |
7250 | nationalgallery.org.uk | 8471 | 5.16 | 200 | HTML 5, English |
7251 | agenciabrasil.ebc.com.br | 8474 | 5.16 | 200 | HTML 5 |
7252 | cancerres.aacrjournals.org | 8475 | 5.16 | 200 | No Lang |
7253 | svelte.dev | 8476 | 5.16 | 200 | HTML 5, English |
7254 | developer.vimeo.com | 8477 | 5.16 | 200 | HTML 5, English |
7255 | allmylinks.com | 8478 | 5.15 | 200 | HTML 5, English |
7256 | scholar.google.com.au | 8479 | 5.15 | 200 | HTML 5, No Lang |
7257 | ci.nii.ac.jp | 8480 | 5.15 | 200 | HTML 5, English |
7258 | extensiblewebmanifesto.org | 8481 | 5.15 | 200 | HTML 5, No Lang |
7259 | branch.io | 8482 | 5.15 | 200 | HTML 5, English |
7260 | cyberscoop.com | 8484 | 5.15 | 200 | HTML 5, English |
7261 | xerox.com | 8485 | 5.15 | 200 | HTML 5, English |
7262 | childrenshospital.org | 8487 | 5.15 | 200 | HTML 5, English |
7263 | goodtherapy.org | 8488 | 5.15 | 200 | HTML 5, English |
7264 | 247wallst.com | 8489 | 5.15 | 200 | HTML 5, English |
7265 | clickorlando.com | 8490 | 5.15 | 200 | HTML 5, English |
7266 | omaha.com | 8492 | 5.15 | 200 | HTML 5, English |
7267 | thegreatcourses.com | 8493 | 5.15 | 200 | HTML 5, English |
7268 | pressherald.com | 8494 | 5.15 | 200 | HTML 5, No Lang |
7269 | unimelb.edu.au | 8495 | 5.15 | 200 | HTML 5, English |
7270 | npg.org.uk | 8496 | 5.15 | 200 | HTML 5, English |
7271 | nabu.de | 8497 | 5.15 | 200 | HTML 5 |
7272 | policylink.org | 8499 | 5.15 | 200 | HTML 5, English |
7273 | trustedreviews.com | 8502 | 5.15 | 200 | HTML 5, English |
7274 | canlii.org | 8503 | 5.15 | 200 | HTML 5, English |
7275 | bmo.com | 8504 | 5.15 | 200 | HTML 5, English |
7276 | goodnewsnetwork.org | 8506 | 5.15 | 200 | English |
7277 | cseweb.ucsd.edu | 8507 | 5.15 | 200 | HTML 5, English |
7278 | n.news.naver.com | 8508 | 5.15 | 200 | HTML 5 |
7279 | csee.umbc.edu | 8509 | 5.15 | 200 | HTML 5, English |
7280 | inspirehep.net | 8510 | 5.15 | 200 | HTML 5, English |
7281 | daisycon.com | 8511 | 5.15 | 200 | HTML 5, English |
7282 | wmagazine.com | 8512 | 5.15 | 200 | HTML 5, English |
7283 | madamenoire.com | 8513 | 5.15 | 200 | HTML 5, English |
7284 | fox9.com | 8514 | 5.15 | 200 | HTML 5, English |
7285 | opensource.google.com | 8515 | 5.15 | 200 | HTML 5, English |
7286 | rnib.org.uk | 8516 | 5.15 | 200 | HTML 5, English |
7287 | ch.linkedin.com | 8517 | 5.15 | 200 | HTML 5 |
7288 | plagiarismtoday.com | 8518 | 5.15 | 200 | HTML 5, English |
7289 | instant.page | 8519 | 5.15 | 200 | HTML 5, English |
7290 | forms.monday.com | 8520 | 5.15 | 200 | HTML 5, English |
7291 | flock.com | 8522 | 5.15 | 200 | HTML 5, English |
7292 | wnd.com | 8523 | 5.15 | 200 | HTML 5, English |
7293 | ovh.com | 8524 | 5.15 | 200 | HTML 5, English |
7294 | hal.inria.fr | 8525 | 5.15 | 200 | HTML 5, English |
7295 | le.utah.gov | 8526 | 5.15 | 200 | HTML 5, English |
7296 | uchicago.edu | 8527 | 5.15 | 200 | HTML 5, English |
7297 | framebridge.com | 8528 | 5.15 | 200 | HTML 5, English |
7298 | mlb.mlb.com | 8529 | 5.15 | 200 | HTML 5, English |
7299 | saude.abril.com.br | 8530 | 5.15 | 200 | HTML 5 |
7300 | discoverlosangeles.com | 8532 | 5.15 | 200 | HTML 5, English |
Data from: Open PageRank