Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
12501 | aha.io | 14590 | 4.98 | 200 | HTML 5, No Lang |
12502 | people.seas.harvard.edu | 14591 | 4.98 | 200 | HTML 5, English |
12503 | edge.org | 14592 | 4.98 | 200 | English |
12504 | 247sports.com | 14593 | 4.98 | 200 | HTML 5, English |
12505 | geekdashboard.com | 14596 | 4.98 | 200 | HTML 5, English |
12506 | scipy.org | 14598 | 4.98 | 200 | HTML 5, English |
12507 | rtlplay.be | 14599 | 4.98 | 200 | HTML 5 |
12508 | ora.ox.ac.uk | 14600 | 4.98 | 200 | HTML 5, English |
12509 | endangered.org | 14601 | 4.98 | 200 | HTML 5, English |
12510 | officesnapshots.com | 14602 | 4.98 | 200 | No Lang, Transitional |
12511 | doctissimo.fr | 14605 | 4.98 | 200 | HTML 5 |
12512 | classes.bnf.fr | 14607 | 4.98 | 200 | |
12513 | guardian.co.tt | 14608 | 4.98 | 200 | HTML 5, English |
12514 | insurancebusinessmag.com | 14609 | 4.98 | 200 | HTML 5, English |
12515 | community.esri.com | 14610 | 4.98 | 200 | HTML 5, English |
12516 | csudh.edu | 14611 | 4.98 | 200 | HTML 5, English |
12517 | vodafone.cz | 14612 | 4.98 | 200 | HTML 5 |
12518 | bimigroup.org | 14613 | 4.98 | 200 | HTML 5, English |
12519 | google.lk | 14614 | 4.98 | 200 | HTML 5, English |
12520 | thelocal.fr | 14615 | 4.98 | 200 | HTML 5, English |
12521 | buff.ly | 14616 | 4.98 | 200 | HTML 5, English |
12522 | worldcoin.org | 14617 | 4.98 | 200 | HTML 5, English |
12523 | saylor.org | 14618 | 4.98 | 200 | HTML 5, English |
12524 | site.uottawa.ca | 14619 | 4.98 | 200 | HTML 5, English |
12525 | onlinephp.io | 14621 | 4.98 | 200 | HTML 5, English |
12526 | igg.me | 14622 | 4.98 | 200 | HTML 5, No Lang |
12527 | kodansha.co.jp | 14623 | 4.98 | 200 | No Lang, Transitional |
12528 | uploadcare.com | 14624 | 4.98 | 200 | HTML 5, English |
12529 | marieclaire.co.uk | 14625 | 4.98 | 200 | HTML 5, English |
12530 | elemental.medium.com | 14626 | 4.98 | 200 | HTML 5, English |
12531 | thetrace.org | 14628 | 4.98 | 200 | HTML 5, English |
12532 | cleantalk.org | 14629 | 4.98 | 200 | HTML 5, English |
12533 | offshoreleaks.icij.org | 14630 | 4.98 | 200 | HTML 5, No Lang |
12534 | try.crashlytics.com | 14631 | 4.98 | 200 | HTML 5, English |
12535 | sl.wikipedia.org | 14632 | 4.98 | 200 | HTML 5, No Lang |
12536 | ansi.org | 14633 | 4.98 | 200 | HTML 5, English |
12537 | raeng.org.uk | 14634 | 4.98 | 200 | HTML 5, English |
12538 | intervalworld.com | 14636 | 4.98 | 200 | HTML 5, English |
12539 | getmonero.org | 14637 | 4.98 | 200 | HTML 5, English |
12540 | math.uchicago.edu | 14638 | 4.98 | 200 | HTML 5, English |
12541 | ipwatchdog.com | 14639 | 4.98 | 200 | HTML 5, English |
12542 | mzl.la | 14640 | 4.98 | 200 | HTML 5, English |
12543 | changelog.com | 14641 | 4.98 | 200 | HTML 5, English |
12544 | pdfdrive.com | 14642 | 4.98 | 200 | No Lang, Transitional |
12545 | boxesandarrows.com | 14643 | 4.98 | 200 | HTML 5, English |
12546 | timetoast.com | 14645 | 4.98 | 200 | HTML 5, English |
12547 | thawte.com | 14646 | 4.98 | 200 | HTML 5, English |
12548 | biblio.ugent.be | 14647 | 4.98 | 200 | HTML 5 |
12549 | indiatimes.com | 14649 | 4.98 | 200 | HTML 5, English |
12550 | kennelliitto.fi | 14650 | 4.98 | 200 | HTML 5 |
12551 | ncaa.org | 14651 | 4.98 | 200 | HTML 5, English |
12552 | universeodon.com | 14652 | 4.98 | 200 | HTML 5, English |
12553 | internetnews.com | 14653 | 4.98 | 200 | English |
12554 | steadfastlutherans.org | 14654 | 4.98 | 200 | HTML 5, English |
12555 | andrealazzarotto.com | 14655 | 4.98 | 200 | HTML 5 |
12556 | newsok.com | 14656 | 4.98 | 200 | HTML 5, English |
12557 | helda.helsinki.fi | 14657 | 4.98 | 200 | HTML 5, No Lang |
12558 | alislam.org | 14658 | 4.98 | 200 | HTML 5, English |
12559 | hadoop.apache.org | 14659 | 4.98 | 200 | HTML 5, English |
12560 | porta.de | 14661 | 4.98 | 200 | HTML 5 |
12561 | foxweather.com | 14662 | 4.98 | 200 | HTML 5, English |
12562 | careers.microsoft.com | 14663 | 4.98 | 200 | HTML 5, English |
12563 | pratham.org | 14665 | 4.98 | 200 | HTML 5, English |
12564 | ischool.berkeley.edu | 14666 | 4.98 | 200 | HTML 5, English |
12565 | thenewinquiry.com | 14668 | 4.98 | 200 | HTML 5, English |
12566 | blog.virustotal.com | 14669 | 4.98 | 200 | HTML 5, No Lang |
12567 | cutimes.com | 14670 | 4.98 | 200 | HTML 5, English |
12568 | challenges.fr | 14671 | 4.98 | 200 | HTML 5 |
12569 | bookcrossing.com | 14672 | 4.98 | 200 | HTML 5, No Lang |
12570 | exploringjs.com | 14673 | 4.98 | 200 | HTML 5, No Lang |
12571 | wonder.cdc.gov | 14674 | 4.98 | 200 | HTML 5, English |
12572 | americanart.si.edu | 14675 | 4.98 | 200 | HTML 5, English |
12573 | epoca.globo.com | 14676 | 4.98 | 200 | HTML 5 |
12574 | freshome.com | 14677 | 4.98 | 200 | HTML 5, English |
12575 | mom.gov.sg | 14678 | 4.98 | 200 | HTML 5, English |
12576 | breaker.audio | 14679 | 4.98 | 200 | HTML 5, English |
12577 | tobiasahlin.com | 14680 | 4.98 | 200 | HTML 5, English |
12578 | bocoup.com | 14682 | 4.98 | 200 | HTML 5, English |
12579 | mcny.org | 14683 | 4.98 | 200 | HTML 5, English |
12580 | rocketreach.co | 14684 | 4.98 | 200 | HTML 5, English |
12581 | scummvm.org | 14685 | 4.98 | 200 | HTML 5, English |
12582 | recruiter.com | 14686 | 4.98 | 200 | HTML 5, English |
12583 | glami.cz | 14687 | 4.98 | 200 | HTML 5 |
12584 | legislature.mi.gov | 14688 | 4.98 | 200 | HTML 5, English |
12585 | thechive.com | 14689 | 4.98 | 200 | HTML 5, English |
12586 | everymac.com | 14690 | 4.98 | 200 | No Lang, Transitional |
12587 | eatright.org | 14691 | 4.98 | 200 | HTML 5, English |
12588 | mastodonapp.uk | 14692 | 4.98 | 200 | HTML 5, English |
12589 | marxist.com | 14693 | 4.98 | 200 | HTML 5, English |
12590 | ria.ee | 14694 | 4.98 | 200 | HTML 5 |
12591 | tc39.github.io | 14695 | 4.98 | 200 | HTML 5, English |
12592 | kleinezeitung.at | 14696 | 4.98 | 200 | HTML 5 |
12593 | upr.edu | 14697 | 4.98 | 200 | No Lang |
12594 | nostarch.com | 14698 | 4.98 | 200 | HTML 5, English |
12595 | wham-o.com | 14699 | 4.98 | 200 | HTML 5, English |
12596 | homestratosphere.com | 14700 | 4.98 | 200 | HTML 5, English |
12597 | cureus.com | 14701 | 4.98 | 200 | HTML 5, No Lang |
12598 | melbournewater.com.au | 14703 | 4.98 | 200 | HTML 5, English |
12599 | auckland.ac.nz | 14704 | 4.98 | 200 | HTML 5, English |
12600 | libertaddigital.com | 14705 | 4.98 | 200 | HTML 5 |
Data from: Open PageRank