Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
12301 | batchgeo.com | 14359 | 4.99 | 200 | HTML 5, English |
12302 | americasquarterly.org | 14360 | 4.99 | 200 | HTML 5, English |
12303 | welcometothejungle.com | 14362 | 4.99 | 200 | HTML 5, English |
12304 | adc.bmj.com | 14363 | 4.99 | 200 | HTML 5, English |
12305 | transmissionbt.com | 14364 | 4.99 | 200 | HTML 5, English |
12306 | blog.varonis.com | 14365 | 4.99 | 200 | HTML 5, English |
12307 | list25.com | 14366 | 4.99 | 200 | HTML 5, English |
12308 | tcf.org | 14367 | 4.99 | 200 | HTML 5, English |
12309 | e-codices.unifr.ch | 14368 | 4.99 | 200 | HTML 5, English |
12310 | picturethisai.com | 14369 | 4.99 | 200 | HTML 5, English |
12311 | mavenclinic.com | 14370 | 4.99 | 200 | HTML 5, English |
12312 | less.works | 14371 | 4.99 | 200 | HTML 5, English |
12313 | roanoke.com | 14372 | 4.99 | 200 | HTML 5, English |
12314 | experiments.withgoogle.com | 14373 | 4.99 | 200 | HTML 5, No Lang |
12315 | ssab.com | 14374 | 4.99 | 200 | HTML 5, English |
12316 | zuerich.com | 14375 | 4.99 | 200 | HTML 5, English |
12317 | slideserve.com | 14376 | 4.99 | 200 | HTML 5, English |
12318 | ifdesign.com | 14377 | 4.99 | 200 | HTML 5, English |
12319 | instapage.com | 14380 | 4.99 | 200 | HTML 5, English |
12320 | tampermonkey.net | 14381 | 4.99 | 200 | HTML 5, English |
12321 | t.cn | 14382 | 4.99 | 200 | HTML 5, No Lang |
12322 | intelligence.org | 14383 | 4.99 | 200 | HTML 5, English |
12323 | csa-iot.org | 14384 | 4.99 | 200 | HTML 5, English |
12324 | spiekermann.com | 14385 | 4.99 | 200 | HTML 5, English |
12325 | klim.co.nz | 14386 | 4.99 | 200 | HTML 5, English |
12326 | depositonce.tu-berlin.de | 14387 | 4.99 | 200 | HTML 5, English |
12327 | siepr.stanford.edu | 14389 | 4.99 | 200 | HTML 5, English |
12328 | the-sun.com | 14390 | 4.99 | 200 | HTML 5, English |
12329 | selenic.com | 14391 | 4.99 | 200 | No Lang |
12330 | opencontent.org | 14393 | 4.99 | 200 | HTML 5, English |
12331 | picpay.com | 14394 | 4.99 | 200 | HTML 5 |
12332 | sam.gov | 14396 | 4.99 | 200 | HTML 5, English |
12333 | raf.mod.uk | 14397 | 4.99 | 200 | HTML 5, English |
12334 | judiciary.senate.gov | 14398 | 4.99 | 200 | HTML 5, English |
12335 | curiositystream.com | 14400 | 4.99 | 200 | HTML 5, English |
12336 | novartis.com | 14401 | 4.99 | 200 | HTML 5, English |
12337 | blog.ethereum.org | 14402 | 4.99 | 200 | HTML 5, English |
12338 | sched.co | 14403 | 4.99 | 200 | HTML 5, English |
12339 | shopzilla.com | 14404 | 4.99 | 200 | HTML 5, English |
12340 | unidata.ucar.edu | 14405 | 4.99 | 200 | HTML 5, No Lang |
12341 | tripsavvy.com | 14406 | 4.99 | 200 | HTML 5, English |
12342 | nknews.org | 14407 | 4.99 | 200 | HTML 5, English |
12343 | courses.washington.edu | 14408 | 4.99 | 200 | No Lang |
12344 | lutron.com | 14410 | 4.99 | 200 | HTML 5, English |
12345 | dnb.de | 14412 | 4.99 | 200 | HTML 5 |
12346 | mindfiresolutions.com | 14413 | 4.99 | 200 | HTML 5, English |
12347 | nautil.us | 14414 | 4.99 | 200 | HTML 5, English |
12348 | abebooks.co.uk | 14415 | 4.99 | 200 | HTML 5, English |
12349 | climaterealityproject.org | 14416 | 4.99 | 200 | English |
12350 | b.link | 14417 | 4.99 | 200 | HTML 5, No Lang |
12351 | aryel.io | 14418 | 4.99 | 200 | HTML 5, English |
12352 | pmg.csail.mit.edu | 14419 | 4.99 | 200 | No Lang |
12353 | statescoop.com | 14420 | 4.99 | 200 | HTML 5, English |
12354 | afdc.energy.gov | 14421 | 4.99 | 200 | HTML 5, English |
12355 | cca.qc.ca | 14422 | 4.99 | 200 | HTML 5, English |
12356 | wpadvancedads.com | 14423 | 4.99 | 200 | HTML 5, English |
12357 | help.aol.com | 14424 | 4.99 | 200 | HTML 5, English |
12358 | usa.chinadaily.com.cn | 14425 | 4.99 | 200 | No Lang, Transitional |
12359 | ihi.org | 14426 | 4.99 | 200 | HTML 5, English |
12360 | timesmachine.nytimes.com | 14428 | 4.99 | 200 | HTML 5, English |
12361 | hormel.com | 14430 | 4.99 | 200 | HTML 5, English |
12362 | ustravel.org | 14431 | 4.99 | 200 | HTML 5, English |
12363 | relaischateaux.com | 14432 | 4.99 | 200 | HTML 5 |
12364 | privacypolicytemplate.net | 14433 | 4.99 | 200 | HTML 5, English |
12365 | ur.se | 14435 | 4.99 | 200 | HTML 5 |
12366 | citilink.ru | 14436 | 4.99 | 200 | HTML 5 |
12367 | global.epson.com | 14438 | 4.99 | 200 | English, Transitional |
12368 | ucsd.edu | 14439 | 4.99 | 200 | HTML 5, English |
12369 | metrotransit.org | 14440 | 4.99 | 200 | HTML 5, English |
12370 | worldpackers.com | 14441 | 4.99 | 200 | HTML 5, English |
12371 | forecast.weather.gov | 14444 | 4.99 | 200 | No Lang, Transitional |
12372 | radiantmediaplayer.com | 14445 | 4.99 | 200 | HTML 5, English |
12373 | linkwhisper.com | 14446 | 4.99 | 200 | HTML 5, English |
12374 | landr.com | 14447 | 4.99 | 200 | HTML 5, English |
12375 | healio.com | 14448 | 4.99 | 200 | HTML 5, English |
12376 | orbitmedia.com | 14450 | 4.99 | 200 | HTML 5, English |
12377 | basketball-reference.com | 14451 | 4.99 | 200 | HTML 5, English |
12378 | brokeassstuart.com | 14452 | 4.99 | 200 | HTML 5, English |
12379 | visitdubai.com | 14455 | 4.99 | 200 | HTML 5, English |
12380 | celebratingsweets.com | 14456 | 4.99 | 200 | HTML 5, English |
12381 | phillyvoice.com | 14457 | 4.99 | 200 | HTML 5, No Lang |
12382 | farm.bot | 14458 | 4.99 | 200 | HTML 5, English |
12383 | eos.com | 14459 | 4.99 | 200 | HTML 5, English |
12384 | nls.uk | 14460 | 4.99 | 200 | HTML 5, English |
12385 | people.epfl.ch | 14461 | 4.99 | 200 | HTML 5, No Lang |
12386 | m.imgur.com | 14462 | 4.99 | 200 | HTML 5, English |
12387 | cs.auckland.ac.nz | 14463 | 4.99 | 200 | HTML 5, English |
12388 | fbo.gov | 14464 | 4.99 | 200 | HTML 5, English |
12389 | portal.bsnl.in | 14465 | 4.99 | 200 | HTML 5, English |
12390 | channel5.com | 14466 | 4.99 | 200 | HTML 5, English |
12391 | nagpurtoday.in | 14467 | 4.99 | 200 | HTML 5, English |
12392 | bristolpost.co.uk | 14468 | 4.99 | 200 | HTML 5, English |
12393 | jlab.org | 14469 | 4.99 | 200 | HTML 5, English |
12394 | brunel.ac.uk | 14470 | 4.99 | 200 | HTML 5, English |
12395 | irp-cdn.multiscreensite.com | 14471 | 4.99 | 200 | No Lang |
12396 | tithe.ly | 14472 | 4.99 | 200 | HTML 5, English |
12397 | oig.hhs.gov | 14473 | 4.99 | 200 | HTML 5, No Lang |
12398 | adinserter.pro | 14474 | 4.99 | 200 | HTML 5, English |
12399 | microbiologyresearch.org | 14476 | 4.99 | 200 | HTML 5, No Lang |
12400 | jaspreetchahal.org | 14478 | 4.99 | 200 | English, Transitional |
Data from: Open PageRank