Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
10501 | gm.com | 12246 | 5.03 | 200 | English, Strict |
10502 | kakao.com | 12247 | 5.03 | 200 | HTML 5 |
10503 | setn.com | 12248 | 5.03 | 200 | Transitional |
10504 | angular.dev | 12249 | 5.03 | 200 | HTML 5, English |
10505 | hackread.com | 12250 | 5.03 | 200 | HTML 5, English |
10506 | lafriche.org | 12251 | 5.03 | 200 | HTML 5 |
10507 | poeditor.com | 12252 | 5.03 | 200 | HTML 5, English |
10508 | japantoday.com | 12253 | 5.03 | 200 | HTML 5, English |
10509 | mdcalc.com | 12254 | 5.03 | 200 | HTML 5, English |
10510 | energycommerce.house.gov | 12255 | 5.03 | 200 | HTML 5, English |
10511 | yslow.org | 12256 | 5.03 | 200 | HTML 5, No Lang |
10512 | radiomuseum.org | 12257 | 5.03 | 200 | English |
10513 | tampere.fi | 12258 | 5.03 | 200 | HTML 5 |
10514 | tel.meet | 12259 | 5.03 | 200 | HTML 5, English |
10515 | mention.com | 12260 | 5.03 | 200 | HTML 5, English |
10516 | konserthuset.se | 12261 | 5.03 | 200 | HTML 5 |
10517 | kempinski.com | 12262 | 5.03 | 200 | HTML 5, English |
10518 | linkprotect.cudasvc.com | 12263 | 5.03 | 200 | No Lang |
10519 | coned.com | 12264 | 5.03 | 200 | HTML 5, English |
10520 | data.gov.cy | 12266 | 5.03 | 200 | HTML 5 |
10521 | health.google | 12267 | 5.03 | 200 | HTML 5, English |
10522 | selloship.com | 12268 | 5.03 | 200 | HTML 5, English |
10523 | futura-sciences.com | 12270 | 5.03 | 200 | HTML 5 |
10524 | totalenergies.com | 12271 | 5.03 | 200 | HTML 5, English |
10525 | ecfr.eu | 12272 | 5.03 | 200 | HTML 5, English |
10526 | blog.ycombinator.com | 12273 | 5.03 | 200 | HTML 5, English |
10527 | discoveryplus.com | 12274 | 5.03 | 200 | HTML 5, English |
10528 | darktrace.com | 12275 | 5.03 | 200 | HTML 5, English |
10529 | taskade.com | 12276 | 5.03 | 200 | HTML 5, English |
10530 | ihmc.us | 12278 | 5.03 | 200 | HTML 5, No Lang |
10531 | hemmings.com | 12279 | 5.03 | 200 | No Lang |
10532 | atmarkit.co.jp | 12281 | 5.03 | 200 | Transitional |
10533 | govdata.de | 12282 | 5.03 | 200 | HTML 5 |
10534 | founders.archives.gov | 12283 | 5.03 | 200 | HTML 5, English |
10535 | lifehacker.ru | 12284 | 5.03 | 200 | HTML 5 |
10536 | philarchive.org | 12286 | 5.03 | 200 | No Lang, Strict |
10537 | sweepwidget.com | 12287 | 5.03 | 200 | HTML 5, No Lang |
10538 | behr.com | 12288 | 5.03 | 200 | HTML 5, English |
10539 | advertising.amazon.com | 12289 | 5.03 | 200 | HTML 5, English |
10540 | prosopo.io | 12291 | 5.03 | 200 | HTML 5, English |
10541 | weatherbug.com | 12292 | 5.03 | 200 | HTML 5, No Lang |
10542 | bugs.debian.org | 12293 | 5.03 | 200 | English, Strict |
10543 | sankei.com | 12294 | 5.03 | 200 | HTML 5 |
10544 | college.columbia.edu | 12295 | 5.03 | 200 | HTML 5, No Lang |
10545 | futurity.org | 12296 | 5.03 | 200 | HTML 5, English |
10546 | cph.org | 12297 | 5.03 | 200 | HTML 5, English |
10547 | thesprucecrafts.com | 12298 | 5.03 | 200 | HTML 5, No Lang |
10548 | twice.com | 12299 | 5.03 | 200 | HTML 5, English |
10549 | codereview.chromium.org | 12300 | 5.03 | 200 | HTML 5, No Lang |
10550 | intelligentcio.com | 12301 | 5.03 | 200 | HTML 5, English |
10551 | oversight.house.gov | 12302 | 5.03 | 200 | English |
10552 | newyorkyimby.com | 12303 | 5.03 | 200 | HTML 5, English |
10553 | industry.gov.au | 12304 | 5.03 | 200 | HTML 5, English |
10554 | uwo.ca | 12305 | 5.03 | 200 | HTML 5, English |
10555 | voicebot.ai | 12306 | 5.03 | 200 | HTML 5, English |
10556 | pcrm.org | 12307 | 5.03 | 200 | HTML 5, English |
10557 | ncatlab.org | 12309 | 5.03 | 200 | No Lang |
10558 | embedded.com | 12310 | 5.03 | 200 | HTML 5, English |
10559 | en.wordpress.com | 12311 | 5.03 | 200 | HTML 5, English |
10560 | decentraland.org | 12312 | 5.03 | 200 | HTML 5, English |
10561 | nch.com.au | 12313 | 5.03 | 200 | English |
10562 | tutor.com | 12314 | 5.03 | 200 | HTML 5, English |
10563 | aguascalientes.gob.mx | 12315 | 5.03 | 200 | No Lang |
10564 | metricool.com | 12316 | 5.03 | 200 | HTML 5, English |
10565 | centreforaviation.com | 12317 | 5.03 | 200 | HTML 5, English |
10566 | mediaklikk.hu | 12318 | 5.03 | 200 | HTML 5 |
10567 | zap.co.il | 12319 | 5.03 | 200 | HTML 5 |
10568 | cyberlaw.stanford.edu | 12320 | 5.03 | 200 | HTML 5, English |
10569 | moztw.org | 12321 | 5.03 | 200 | HTML 5 |
10570 | esta.cbp.dhs.gov | 12323 | 5.03 | 200 | HTML 5, English |
10571 | abb.com | 12324 | 5.03 | 200 | HTML 5, English |
10572 | enjoythemusic.com | 12325 | 5.03 | 200 | No Lang |
10573 | melscience.com | 12326 | 5.03 | 200 | HTML 5, English |
10574 | coreos.com | 12327 | 5.03 | 200 | HTML 5, English |
10575 | comicbook.com | 12328 | 5.03 | 200 | HTML 5, English |
10576 | data.gouv.fr | 12329 | 5.03 | 200 | HTML 5 |
10577 | covid-19.ontario.ca | 12330 | 5.03 | 200 | HTML 5, English |
10578 | newmediarights.org | 12331 | 5.03 | 200 | HTML 5, English |
10579 | aopa.org | 12332 | 5.03 | 200 | HTML 5, English |
10580 | home.openweathermap.org | 12333 | 5.03 | 200 | HTML 5, English |
10581 | hakaimagazine.com | 12334 | 5.03 | 200 | HTML 5, English |
10582 | backpacker.com | 12336 | 5.03 | 200 | HTML 5, English |
10583 | stat.berkeley.edu | 12337 | 5.03 | 200 | HTML 5, English |
10584 | sourcingjournal.com | 12338 | 5.03 | 200 | HTML 5, English |
10585 | parenting.com | 12339 | 5.03 | 200 | HTML 5, English |
10586 | instabio.cc | 12340 | 5.03 | 200 | HTML 5, English |
10587 | psychiatrictimes.com | 12341 | 5.03 | 200 | HTML 5, English |
10588 | devblogs.nvidia.com | 12342 | 5.03 | 200 | HTML 5, English |
10589 | trybooking.com | 12343 | 5.03 | 200 | English |
10590 | jetstar.com | 12344 | 5.03 | 200 | HTML 5, English |
10591 | mapple.co.jp | 12345 | 5.03 | 200 | HTML 5 |
10592 | archive.stsci.edu | 12346 | 5.03 | 200 | HTML 5, English |
10593 | premierguitar.com | 12347 | 5.03 | 200 | HTML 5, English |
10594 | thewritepractice.com | 12348 | 5.03 | 200 | HTML 5, English |
10595 | ghr.nlm.nih.gov | 12350 | 5.03 | 200 | HTML 5, English |
10596 | piszek.com | 12351 | 5.03 | 200 | HTML 5, English |
10597 | rytr.me | 12352 | 5.03 | 200 | HTML 5, English |
10598 | elinux.org | 12353 | 5.03 | 200 | HTML 5, English |
10599 | tech.slashdot.org | 12354 | 5.03 | 200 | English |
10600 | netology.ru | 12355 | 5.03 | 200 | HTML 5 |
Data from: Open PageRank