Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18601 | stylebyemilyhenderson.com | 21675 | 4.87 | 200 | HTML 5, English |
18602 | jcdecaux.com | 21676 | 4.87 | 200 | HTML 5, English |
18603 | columbiadoctors.org | 21677 | 4.87 | 200 | HTML 5, English |
18604 | m.wikihow.com | 21678 | 4.87 | 200 | HTML 5, English |
18605 | knsb.nl | 21679 | 4.87 | 200 | HTML 5 |
18606 | opednews.com | 21680 | 4.87 | 200 | HTML 5, English |
18607 | blog.joinmastodon.org | 21681 | 4.87 | 200 | HTML 5, No Lang |
18608 | nga.org | 21682 | 4.87 | 200 | HTML 5, No Lang |
18609 | excellentwebworld.com | 21683 | 4.87 | 200 | HTML 5, English |
18610 | ddj.com | 21684 | 4.87 | 200 | HTML 5, No Lang |
18611 | datocms.com | 21685 | 4.87 | 200 | HTML 5, English |
18612 | webpushr.com | 21686 | 4.87 | 200 | HTML 5, English |
18613 | nagios.com | 21687 | 4.87 | 200 | HTML 5, English |
18614 | jerseymikes.com | 21688 | 4.87 | 200 | HTML 5, English |
18615 | docs.lib.purdue.edu | 21689 | 4.87 | 200 | HTML 5, English |
18616 | lod-cloud.net | 21690 | 4.87 | 200 | HTML 5, English |
18617 | www2.archivists.org | 21692 | 4.87 | 200 | English, Strict |
18618 | pickuplimes.com | 21693 | 4.87 | 200 | HTML 5, English |
18619 | huffduffer.com | 21694 | 4.87 | 200 | HTML 5, English |
18620 | msn.foxsports.com | 21695 | 4.87 | 200 | HTML 5, English |
18621 | modthesims.info | 21696 | 4.87 | 200 | HTML 5, No Lang |
18622 | donauregion.at | 21697 | 4.87 | 200 | HTML 5 |
18623 | invisible-movement.net | 21698 | 4.87 | 200 | HTML 5, English |
18624 | alko.fi | 21699 | 4.87 | 200 | No Lang |
18625 | getsentry.com | 21700 | 4.87 | 200 | HTML 5, English |
18626 | ichkoche.at | 21701 | 4.87 | 200 | HTML 5 |
18627 | failbettergames.com | 21702 | 4.87 | 200 | HTML 5, English |
18628 | buro247.ru | 21703 | 4.87 | 200 | HTML 5 |
18629 | iihf.com | 21704 | 4.87 | 200 | HTML 5, English |
18630 | store.webkul.com | 21705 | 4.87 | 200 | English, Strict |
18631 | secureworks.com | 21706 | 4.87 | 200 | HTML 5, English |
18632 | greenpeace.de | 21707 | 4.87 | 200 | HTML 5 |
18633 | swe.org | 21708 | 4.87 | 200 | HTML 5, English |
18634 | calculator.io | 21709 | 4.87 | 200 | HTML 5, English |
18635 | archaeology.org | 21710 | 4.87 | 200 | HTML 5, English |
18636 | cincodias.elpais.com | 21711 | 4.87 | 200 | HTML 5 |
18637 | golfchannel.com | 21712 | 4.87 | 200 | HTML 5, English |
18638 | hurricane.de | 21713 | 4.87 | 200 | HTML 5, English |
18639 | sunnah.com | 21714 | 4.87 | 200 | No Lang, Strict |
18640 | soaphub.com | 21715 | 4.87 | 200 | HTML 5, English |
18641 | msdh.ms.gov | 21716 | 4.87 | 200 | HTML 5, No Lang |
18642 | jvns.ca | 21718 | 4.87 | 200 | HTML 5, English |
18643 | red-sweater.com | 21719 | 4.87 | 200 | HTML 5, English |
18644 | jwa.org | 21720 | 4.87 | 200 | HTML 5, English |
18645 | khl.com | 21721 | 4.87 | 200 | HTML 5, English |
18646 | mpra.ub.uni-muenchen.de | 21722 | 4.87 | 200 | HTML 5, No Lang |
18647 | news.developer.nvidia.com | 21724 | 4.87 | 200 | HTML 5, English |
18648 | ridebustang.com | 21725 | 4.87 | 200 | HTML 5, English |
18649 | voices.nationalgeographic.com | 21726 | 4.87 | 200 | HTML 5, No Lang |
18650 | coubic.com | 21728 | 4.87 | 200 | HTML 5 |
18651 | cio.co.uk | 21729 | 4.87 | 200 | HTML 5, English |
18652 | goodereader.com | 21730 | 4.87 | 200 | HTML 5, English |
18653 | armis.com | 21731 | 4.87 | 200 | HTML 5, English |
18654 | cardcow.com | 21733 | 4.87 | 200 | HTML 5, English |
18655 | hihonor.com | 21734 | 4.87 | 200 | HTML 5, English |
18656 | umu.se | 21735 | 4.87 | 200 | HTML 5 |
18657 | scholar.google.com.br | 21736 | 4.87 | 200 | HTML 5, No Lang |
18658 | wanderlog.com | 21737 | 4.87 | 200 | HTML 5, No Lang |
18659 | mrwweb.com | 21738 | 4.87 | 200 | No Lang |
18660 | ankiweb.net | 21739 | 4.87 | 200 | HTML 5, English |
18661 | support.esri.com | 21740 | 4.87 | 200 | HTML 5, English |
18662 | emigre.com | 21741 | 4.87 | 200 | HTML 5, English |
18663 | thevintagenews.com | 21742 | 4.87 | 200 | HTML 5, English |
18664 | organic-chemistry.org | 21743 | 4.87 | 200 | No Lang |
18665 | brainscape.com | 21744 | 4.87 | 200 | HTML 5, English |
18666 | digitalia.be | 21745 | 4.87 | 200 | HTML 5, No Lang |
18667 | kalundborg.dk | 21746 | 4.87 | 200 | HTML 5 |
18668 | nms.ac.uk | 21747 | 4.87 | 200 | HTML 5, English |
18669 | slant.co | 21748 | 4.87 | 200 | HTML 5, English |
18670 | today.line.me | 21749 | 4.87 | 200 | HTML 5 |
18671 | mondoweiss.net | 21750 | 4.87 | 200 | HTML 5, English |
18672 | council.science | 21751 | 4.87 | 200 | HTML 5, English |
18673 | encolombia.com | 21752 | 4.87 | 200 | |
18674 | covenanteyes.com | 21753 | 4.87 | 200 | HTML 5, English |
18675 | riptapparel.com | 21755 | 4.87 | 200 | HTML 5, English |
18676 | sony.co.jp | 21756 | 4.87 | 200 | HTML 5 |
18677 | arnebrachhold.de | 21757 | 4.87 | 200 | HTML 5, English |
18678 | worldremit.com | 21759 | 4.87 | 200 | HTML 5, English |
18679 | megalodon.jp | 21760 | 4.87 | 200 | HTML 5 |
18680 | fondoambiente.it | 21761 | 4.87 | 200 | HTML 5 |
18681 | geeks3d.com | 21762 | 4.87 | 200 | HTML 5, English |
18682 | ilgazzettino.it | 21763 | 4.87 | 200 | HTML 5 |
18683 | css3.info | 21764 | 4.87 | 200 | English, Strict |
18684 | epod.usra.edu | 21765 | 4.87 | 200 | No Lang, Transitional |
18685 | eda.admin.ch | 21766 | 4.87 | 200 | HTML 5 |
18686 | vgwort.de | 21767 | 4.87 | 200 | HTML 5 |
18687 | lkml.iu.edu | 21768 | 4.87 | 200 | HTML 5, No Lang |
18688 | ec.toranoana.jp | 21769 | 4.87 | 200 | HTML 5 |
18689 | telecom.economictimes.indiatimes.com | 21770 | 4.87 | 200 | HTML 5, English |
18690 | neteasegames.com | 21771 | 4.87 | 200 | HTML 5, No Lang |
18691 | magersandquinn.com | 21772 | 4.87 | 200 | No Lang |
18692 | tvprofil.com | 21773 | 4.87 | 200 | HTML 5 |
18693 | privatbank.ua | 21774 | 4.87 | 200 | HTML 5 |
18694 | pomona.edu | 21775 | 4.87 | 200 | HTML 5, English |
18695 | torontomu.ca | 21776 | 4.87 | 200 | HTML 5, English |
18696 | boozt.com | 21780 | 4.87 | 200 | HTML 5, English |
18697 | apprendre.tv5monde.com | 21781 | 4.87 | 200 | HTML 5 |
18698 | efe.cl | 21782 | 4.87 | 200 | HTML 5 |
18699 | hearst.com | 21783 | 4.87 | 200 | HTML 5, English |
18700 | democrats.senate.gov | 21785 | 4.87 | 200 | HTML 5, English |
Data from: Open PageRank