Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
6301 | googleonlinesecurity.blogspot.com | 7392 | 5.20 | 200 | HTML 5, English |
6302 | deliveroo.co.uk | 7393 | 5.20 | 200 | HTML 5, English |
6303 | fireeye.com | 7394 | 5.20 | 200 | HTML 5, No Lang |
6304 | cs.ox.ac.uk | 7395 | 5.20 | 200 | HTML 5, No Lang |
6305 | goaheadtours.com | 7396 | 5.20 | 200 | HTML 5, English |
6306 | nostr.how | 7397 | 5.20 | 200 | HTML 5, English |
6307 | tv4.se | 7398 | 5.20 | 200 | HTML 5 |
6308 | codelabs.developers.google.com | 7399 | 5.20 | 200 | HTML 5, English |
6309 | bjs.gov | 7400 | 5.20 | 200 | HTML 5, English |
6310 | sanrio.com | 7401 | 5.20 | 200 | HTML 5, English |
6311 | polskieradio.pl | 7402 | 5.20 | 200 | HTML 5 |
6312 | eu.usatoday.com | 7403 | 5.20 | 200 | HTML 5, English |
6313 | parachutehome.com | 7404 | 5.20 | 200 | HTML 5, English |
6314 | open-mpi.org | 7406 | 5.20 | 200 | No Lang, Strict |
6315 | th.wikipedia.org | 7407 | 5.20 | 200 | HTML 5, No Lang |
6316 | americanbanker.com | 7408 | 5.20 | 200 | HTML 5, English |
6317 | addons.thunderbird.net | 7409 | 5.20 | 200 | HTML 5, English |
6318 | app.getresponse.com | 7410 | 5.20 | 200 | HTML 5, English |
6319 | wordpress.tv | 7411 | 5.20 | 200 | HTML 5, English |
6320 | cs.nyu.edu | 7412 | 5.20 | 200 | No Lang |
6321 | wral.com | 7413 | 5.20 | 200 | HTML 5, English |
6322 | di.se | 7414 | 5.20 | 200 | HTML 5 |
6323 | allenai.org | 7415 | 5.20 | 200 | HTML 5, English |
6324 | queue.acm.org | 7417 | 5.20 | 200 | No Lang, Strict |
6325 | yourstory.com | 7418 | 5.20 | 200 | HTML 5, English |
6326 | pages.cs.wisc.edu | 7419 | 5.20 | 200 | English |
6327 | lshtm.ac.uk | 7420 | 5.20 | 200 | HTML 5, English |
6328 | france3-regions.francetvinfo.fr | 7421 | 5.20 | 200 | HTML 5 |
6329 | dai.ly | 7422 | 5.20 | 200 | HTML 5, English |
6330 | lumosity.com | 7423 | 5.20 | 200 | HTML 5, English |
6331 | chicago.gov | 7425 | 5.20 | 200 | HTML 5, English |
6332 | mobiloud.com | 7426 | 5.20 | 200 | HTML 5, English |
6333 | esa.un.org | 7427 | 5.20 | 200 | HTML 5, English |
6334 | journalofethics.ama-assn.org | 7428 | 5.20 | 200 | HTML 5, English |
6335 | wallethub.com | 7430 | 5.20 | 200 | English |
6336 | news9.com | 7432 | 5.20 | 200 | HTML 5, English |
6337 | skysports.com | 7433 | 5.20 | 200 | HTML 5, English |
6338 | mastodon.world | 7434 | 5.20 | 200 | HTML 5, English |
6339 | bricklink.com | 7435 | 5.20 | 200 | HTML 5, English |
6340 | weareteachers.com | 7436 | 5.20 | 200 | HTML 5, English |
6341 | toot.io | 7437 | 5.20 | 200 | HTML 5, English |
6342 | techreport.com | 7438 | 5.20 | 200 | HTML 5, English |
6343 | unacademy.com | 7439 | 5.20 | 200 | HTML 5, English |
6344 | msdmanuals.com | 7442 | 5.20 | 200 | HTML 5, English |
6345 | attack.mitre.org | 7443 | 5.20 | 200 | HTML 5, English |
6346 | ubos.org | 7444 | 5.20 | 200 | HTML 5, English |
6347 | x.ai | 7446 | 5.20 | 200 | HTML 5, English |
6348 | slack-files.com | 7447 | 5.20 | 200 | HTML 5, English |
6349 | securityboulevard.com | 7448 | 5.20 | 200 | HTML 5, English |
6350 | designmuseum.org | 7449 | 5.20 | 200 | HTML 5, English |
6351 | link.aps.org | 7450 | 5.20 | 200 | HTML 5, English |
6352 | fcbayern.com | 7451 | 5.20 | 200 | HTML 5, English |
6353 | download.oracle.com | 7452 | 5.20 | 200 | HTML 5, English |
6354 | fxnetworks.com | 7453 | 5.20 | 200 | HTML 5, English |
6355 | free-now.com | 7454 | 5.20 | 200 | HTML 5, English |
6356 | dm.de | 7455 | 5.20 | 200 | HTML 5 |
6357 | lazada.sg | 7456 | 5.20 | 200 | HTML 5, No Lang |
6358 | nczonline.net | 7457 | 5.20 | 200 | HTML 5, English |
6359 | techpilipinas.com | 7458 | 5.20 | 200 | HTML 5, English |
6360 | elledecor.com | 7459 | 5.20 | 200 | HTML 5, English |
6361 | dbpedia.org | 7460 | 5.20 | 200 | HTML 5, English |
6362 | cinematreasures.org | 7462 | 5.20 | 200 | HTML 5, No Lang |
6363 | synthtopia.com | 7463 | 5.20 | 200 | HTML 5, English |
6364 | simonsfoundation.org | 7464 | 5.20 | 200 | HTML 5, English |
6365 | people.howstuffworks.com | 7465 | 5.20 | 200 | HTML 5, English |
6366 | podcastindex.org | 7466 | 5.20 | 200 | HTML 5, English |
6367 | business-humanrights.org | 7467 | 5.20 | 200 | HTML 5, English |
6368 | research.net | 7469 | 5.20 | 200 | HTML 5, English |
6369 | upf.edu | 7470 | 5.20 | 200 | HTML 5 |
6370 | brennancenter.org | 7472 | 5.20 | 200 | HTML 5, English |
6371 | paramountplus.com | 7474 | 5.20 | 200 | HTML 5, English |
6372 | baylor.edu | 7475 | 5.20 | 200 | HTML 5, English |
6373 | mic.com | 7476 | 5.20 | 200 | HTML 5, English |
6374 | idpf.org | 7477 | 5.20 | 200 | English |
6375 | okx.com | 7478 | 5.20 | 200 | HTML 5, English |
6376 | eurosport.com | 7479 | 5.20 | 200 | HTML 5, English |
6377 | schedule.sxsw.com | 7480 | 5.20 | 200 | HTML 5, English |
6378 | modsecurity.org | 7481 | 5.20 | 200 | HTML 5, English |
6379 | patchstack.com | 7482 | 5.20 | 200 | HTML 5, English |
6380 | ubuntuforums.org | 7484 | 5.20 | 200 | English, Transitional |
6381 | infosys.com | 7485 | 5.20 | 200 | HTML 5, English |
6382 | clas.uiowa.edu | 7486 | 5.20 | 200 | HTML 5, English |
6383 | betterment.com | 7487 | 5.20 | 200 | HTML 5, English |
6384 | leg.colorado.gov | 7488 | 5.20 | 200 | HTML 5, English |
6385 | sede.seg-social.gob.es | 7489 | 5.20 | 200 | HTML 5, English |
6386 | flsenate.gov | 7491 | 5.20 | 200 | HTML 5, No Lang |
6387 | courant.com | 7493 | 5.20 | 200 | HTML 5, English |
6388 | activerain.com | 7494 | 5.20 | 200 | HTML 5, No Lang |
6389 | mlssoccer.com | 7495 | 5.20 | 200 | HTML 5, English |
6390 | aftonbladet.se | 7496 | 5.20 | 200 | HTML 5 |
6391 | aspca.org | 7497 | 5.20 | 200 | HTML 5, English |
6392 | steelcase.com | 7498 | 5.20 | 200 | HTML 5, No Lang |
6393 | odmp.org | 7499 | 5.20 | 200 | No Lang, Transitional |
6394 | form.typeform.com | 7500 | 5.20 | 200 | HTML 5, English |
6395 | grants.gov | 7501 | 5.20 | 200 | HTML 5, English |
6396 | hackerrank.com | 7502 | 5.20 | 200 | HTML 5, English |
6397 | prlog.org | 7504 | 5.20 | 200 | HTML 5, No Lang |
6398 | skift.com | 7505 | 5.20 | 200 | HTML 5, English |
6399 | support.avg.com | 7506 | 5.20 | 200 | No Lang, Transitional |
6400 | ops.fhwa.dot.gov | 7507 | 5.20 | 200 | English, Transitional |
Data from: Open PageRank