Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
16601 | capitaloneshopping.com | 19359 | 4.90 | 200 | HTML 5, English |
16602 | tech.eu | 19360 | 4.90 | 200 | HTML 5, English |
16603 | architecture.mit.edu | 19361 | 4.90 | 200 | HTML 5, English |
16604 | world.taobao.com | 19362 | 4.90 | 200 | HTML 5 |
16605 | air.org | 19363 | 4.90 | 200 | HTML 5, English |
16606 | corp.toei-anim.co.jp | 19364 | 4.90 | 200 | HTML 5 |
16607 | fix.com | 19365 | 4.90 | 200 | HTML 5, English |
16608 | bojangles.com | 19366 | 4.90 | 200 | HTML 5, English |
16609 | flexibits.com | 19367 | 4.90 | 200 | HTML 5, English |
16610 | skydrive.live.com | 19368 | 4.90 | 200 | HTML 5, English |
16611 | vfw.org | 19369 | 4.90 | 200 | HTML 5, English |
16612 | now.tufts.edu | 19370 | 4.90 | 200 | HTML 5, English |
16613 | babelnet.org | 19371 | 4.90 | 200 | HTML 5, English |
16614 | dreadcentral.com | 19372 | 4.90 | 200 | HTML 5, English |
16615 | monumentum.fr | 19373 | 4.90 | 200 | HTML 5 |
16616 | superrare.com | 19374 | 4.90 | 200 | HTML 5, English |
16617 | turnoffthelights.com | 19375 | 4.90 | 200 | English |
16618 | silabs.com | 19376 | 4.90 | 200 | HTML 5, English |
16619 | en.mapy.cz | 19377 | 4.90 | 200 | HTML 5, No Lang |
16620 | inf.fu-berlin.de | 19378 | 4.90 | 200 | HTML 5 |
16621 | avrotros.nl | 19379 | 4.90 | 200 | HTML 5 |
16622 | igg.com | 19380 | 4.90 | 200 | HTML 5, English |
16623 | roomtoread.org | 19381 | 4.90 | 200 | HTML 5, English |
16624 | psyche.co | 19382 | 4.90 | 200 | HTML 5, English |
16625 | bismarcktribune.com | 19383 | 4.90 | 200 | HTML 5, English |
16626 | paylocity.com | 19384 | 4.90 | 200 | HTML 5, English |
16627 | feinberg.northwestern.edu | 19385 | 4.90 | 200 | HTML 5, English |
16628 | rpubs.com | 19386 | 4.90 | 200 | HTML 5, English |
16629 | spotahome.com | 19387 | 4.90 | 200 | HTML 5, English |
16630 | paloaltoonline.com | 19388 | 4.90 | 200 | HTML 5, English |
16631 | uniden.com.au | 19389 | 4.90 | 200 | HTML 5, English |
16632 | build.prestashop-project.org | 19390 | 4.90 | 200 | HTML 5, English |
16633 | fortanix.com | 19391 | 4.90 | 200 | HTML 5, English |
16634 | libraryjournal.com | 19392 | 4.90 | 200 | No Lang, Transitional |
16635 | review.chicagobooth.edu | 19393 | 4.90 | 200 | HTML 5, English |
16636 | northumbria.ac.uk | 19394 | 4.90 | 200 | HTML 5, English |
16637 | hdfgroup.org | 19395 | 4.90 | 200 | HTML 5, English |
16638 | thesundaytimes.co.uk | 19396 | 4.90 | 200 | HTML 5, English |
16639 | vitejs.dev | 19397 | 4.90 | 200 | HTML 5, English |
16640 | evilmadscientist.com | 19398 | 4.90 | 200 | HTML 5, English |
16641 | snob.ru | 19399 | 4.90 | 200 | HTML 5 |
16642 | login.yahoo.com | 19400 | 4.90 | 200 | HTML 5, English |
16643 | cvut.cz | 19401 | 4.90 | 200 | HTML 5 |
16644 | austlii.edu.au | 19402 | 4.90 | 200 | HTML 5, English |
16645 | enstinemuki.com | 19404 | 4.90 | 200 | HTML 5, English |
16646 | jobs.wordpress.net | 19405 | 4.90 | 200 | HTML 5, English |
16647 | crosswire.org | 19407 | 4.90 | 200 | English, Strict |
16648 | ecy.wa.gov | 19408 | 4.90 | 200 | HTML 5, English |
16649 | eurasiareview.com | 19409 | 4.90 | 200 | HTML 5, English |
16650 | bluino.com | 19410 | 4.90 | 200 | HTML 5, No Lang |
16651 | infusionsoft.com | 19411 | 4.90 | 200 | HTML 5, English |
16652 | brethren.org | 19413 | 4.90 | 200 | HTML 5, English |
16653 | perkinscoie.com | 19414 | 4.90 | 200 | HTML 5, English |
16654 | bni.com | 19415 | 4.90 | 200 | HTML 5, English |
16655 | arretsurimages.net | 19416 | 4.90 | 200 | HTML 5, No Lang |
16656 | status.openai.com | 19417 | 4.90 | 200 | HTML 5, English |
16657 | blog.ucsusa.org | 19419 | 4.90 | 200 | HTML 5, English |
16658 | brahmakumaris.org | 19420 | 4.90 | 200 | HTML 5, English |
16659 | zona.media | 19422 | 4.90 | 200 | HTML 5 |
16660 | realtor.ca | 19423 | 4.90 | 200 | No Lang |
16661 | wam.ae | 19424 | 4.90 | 200 | HTML 5, English |
16662 | sibcolombia.net | 19425 | 4.90 | 200 | HTML 5 |
16663 | nodle.io | 19426 | 4.90 | 200 | HTML 5, English |
16664 | bodc.ac.uk | 19427 | 4.90 | 200 | HTML 5, No Lang |
16665 | dhm.de | 19428 | 4.90 | 200 | HTML 5 |
16666 | az-theme.net | 19430 | 4.90 | 200 | HTML 5, English |
16667 | bouboulis.mysch.gr | 19432 | 4.90 | 200 | English |
16668 | fusejs.io | 19433 | 4.90 | 200 | HTML 5, English |
16669 | eorthopod.com | 19434 | 4.90 | 200 | HTML 5, English |
16670 | muun.com | 19435 | 4.90 | 200 | HTML 5, No Lang |
16671 | gridinsoft.com | 19436 | 4.90 | 200 | HTML 5, English |
16672 | lifeasahuman.com | 19437 | 4.90 | 200 | HTML 5, English |
16673 | hilltimes.com | 19438 | 4.90 | 200 | HTML 5, English |
16674 | oaklandlibrary.org | 19439 | 4.90 | 200 | HTML 5, English |
16675 | marssociety.org | 19440 | 4.90 | 200 | HTML 5, English |
16676 | paulund.co.uk | 19441 | 4.90 | 200 | HTML 5, English |
16677 | memory-alpha.fandom.com | 19442 | 4.90 | 200 | HTML 5, English |
16678 | yiiframework.com | 19443 | 4.90 | 200 | HTML 5, English |
16679 | ocbc.com | 19444 | 4.90 | 200 | HTML 5, No Lang |
16680 | tablepress.org | 19445 | 4.90 | 200 | HTML 5, English |
16681 | spinupwp.com | 19446 | 4.90 | 200 | HTML 5, English |
16682 | payu.in | 19447 | 4.90 | 200 | HTML 5, English |
16683 | cnes.fr | 19448 | 4.90 | 200 | HTML 5 |
16684 | neteller.com | 19449 | 4.90 | 200 | HTML 5, English |
16685 | ksltv.com | 19450 | 4.90 | 200 | HTML 5, English |
16686 | st-helens.org.uk | 19452 | 4.90 | 200 | HTML 5, English |
16687 | codedread.com | 19453 | 4.90 | 200 | HTML 5, English |
16688 | readability.com | 19454 | 4.90 | 200 | HTML 5, English |
16689 | pente.org | 19455 | 4.90 | 200 | English, Transitional |
16690 | thisislondon.co.uk | 19457 | 4.90 | 200 | HTML 5, English |
16691 | sejda.com | 19458 | 4.90 | 200 | HTML 5, English |
16692 | code.fb.com | 19459 | 4.90 | 200 | HTML 5, English |
16693 | portugal.gov.pt | 19460 | 4.90 | 200 | HTML 5 |
16694 | cbu.uz | 19461 | 4.90 | 200 | HTML 5 |
16695 | pcwelt.de | 19462 | 4.90 | 200 | HTML 5 |
16696 | u-paris.fr | 19464 | 4.90 | 200 | HTML 5 |
16697 | tutorial.math.lamar.edu | 19465 | 4.90 | 200 | HTML 5, No Lang |
16698 | paysend.com | 19466 | 4.90 | 200 | HTML 5, English |
16699 | ilr.cornell.edu | 19467 | 4.90 | 200 | HTML 5, English |
16700 | pret.co.uk | 19468 | 4.90 | 200 | HTML 5, English |
Data from: Open PageRank