Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18301 | formpl.us | 21331 | 4.87 | 200 | HTML 5, English |
18302 | de.leica-camera.com | 21332 | 4.87 | 200 | HTML 5 |
18303 | vedantu.com | 21333 | 4.87 | 200 | HTML 5, English |
18304 | akronchildrens.org | 21334 | 4.87 | 200 | HTML 5, English |
18305 | thesalmons.org | 21335 | 4.87 | 200 | No Lang |
18306 | greencarreports.com | 21336 | 4.87 | 200 | HTML 5, English |
18307 | scrutinizer-ci.com | 21337 | 4.87 | 200 | English, Strict |
18308 | britishlibrary.typepad.co.uk | 21338 | 4.87 | 200 | HTML 5, No Lang |
18309 | tray.io | 21339 | 4.87 | 200 | HTML 5, English |
18310 | lib.umn.edu | 21340 | 4.87 | 200 | HTML 5, English |
18311 | learner.org | 21341 | 4.87 | 200 | HTML 5, English |
18312 | frandroid.com | 21342 | 4.87 | 200 | HTML 5 |
18313 | mypacer.com | 21343 | 4.87 | 200 | HTML 5, English |
18314 | chopra.com | 21344 | 4.87 | 200 | HTML 5, English |
18315 | collection.cooperhewitt.org | 21345 | 4.87 | 200 | HTML 5, No Lang |
18316 | openfontlibrary.org | 21346 | 4.87 | 200 | HTML 5, English |
18317 | chelseagreen.com | 21347 | 4.87 | 200 | HTML 5, English |
18318 | dor.mo.gov | 21348 | 4.87 | 200 | HTML 5, English |
18319 | bozell.com | 21349 | 4.87 | 200 | HTML 5, English |
18320 | public.nrao.edu | 21350 | 4.87 | 200 | HTML 5, English |
18321 | baseballhall.org | 21351 | 4.87 | 200 | HTML 5, English |
18322 | jdwetherspoon.com | 21352 | 4.87 | 200 | HTML 5, English |
18323 | miamioh.edu | 21353 | 4.87 | 200 | HTML 5, English |
18324 | canon.com.au | 21355 | 4.87 | 200 | HTML 5, No Lang |
18325 | museoreinasofia.es | 21356 | 4.87 | 200 | HTML 5 |
18326 | typesense.org | 21358 | 4.87 | 200 | HTML 5, English |
18327 | aidenlab.org | 21361 | 4.87 | 200 | HTML 5, No Lang |
18328 | globest.com | 21362 | 4.87 | 200 | HTML 5, English |
18329 | ftsafe.com | 21363 | 4.87 | 200 | HTML 5, English |
18330 | oneall.com | 21364 | 4.87 | 200 | HTML 5, English |
18331 | sangoma.com | 21365 | 4.87 | 200 | HTML 5, English |
18332 | jonassebastianohlsson.com | 21366 | 4.87 | 200 | HTML 5, No Lang |
18333 | stevespanglerscience.com | 21368 | 4.87 | 200 | English |
18334 | plugins.qgis.org | 21369 | 4.87 | 200 | No Lang |
18335 | belspo.be | 21370 | 4.87 | 200 | HTML 5, English |
18336 | objkt.com | 21371 | 4.87 | 200 | HTML 5, English |
18337 | datarobot.com | 21373 | 4.87 | 200 | HTML 5, English |
18338 | docutils.sourceforge.net | 21374 | 4.87 | 200 | HTML 5, English |
18339 | yeastgenome.org | 21376 | 4.87 | 200 | HTML 5, No Lang |
18340 | amadeus.com | 21377 | 4.87 | 200 | HTML 5, English |
18341 | nationwide.com | 21378 | 4.87 | 200 | English |
18342 | inbloombakery.com | 21379 | 4.87 | 200 | HTML 5, English |
18343 | poptin.com | 21380 | 4.87 | 200 | HTML 5, English |
18344 | lp.constantcontactpages.com | 21381 | 4.87 | 200 | HTML 5, No Lang |
18345 | docs.klarna.com | 21382 | 4.87 | 200 | HTML 5, English |
18346 | mpia.de | 21383 | 4.87 | 200 | HTML 5 |
18347 | nhsggc.org.uk | 21384 | 4.87 | 200 | HTML 5, English |
18348 | aveda.com | 21386 | 4.87 | 200 | HTML 5, English |
18349 | osha.europa.eu | 21388 | 4.87 | 200 | HTML 5, English |
18350 | thedodo.com | 21389 | 4.87 | 200 | HTML 5, English |
18351 | issuelab.org | 21390 | 4.87 | 200 | HTML 5, English |
18352 | heart.bmj.com | 21391 | 4.87 | 200 | HTML 5, English |
18353 | animationmagazine.net | 21392 | 4.87 | 200 | English |
18354 | inrupt.com | 21393 | 4.87 | 200 | HTML 5, English |
18355 | kce.fgov.be | 21394 | 4.87 | 200 | HTML 5, English |
18356 | mediadecoder.blogs.nytimes.com | 21395 | 4.87 | 200 | HTML 5, English |
18357 | polska-zbrojna.pl | 21396 | 4.87 | 200 | HTML 5, No Lang |
18358 | adbusters.org | 21397 | 4.87 | 200 | HTML 5, No Lang |
18359 | lonny.com | 21398 | 4.87 | 200 | HTML 5, English |
18360 | windfinder.com | 21399 | 4.87 | 200 | HTML 5, English |
18361 | bugherd.com | 21400 | 4.87 | 200 | HTML 5, English |
18362 | blackfire.io | 21401 | 4.87 | 200 | HTML 5, English |
18363 | esafety.gov.au | 21402 | 4.87 | 200 | HTML 5, English |
18364 | eatingbirdfood.com | 21403 | 4.87 | 200 | HTML 5, English |
18365 | stickertalk.com | 21404 | 4.87 | 200 | HTML 5, English |
18366 | tr.pinterest.com | 21405 | 4.87 | 200 | HTML 5, English |
18367 | institutionalinvestor.com | 21407 | 4.87 | 200 | HTML 5, English |
18368 | spectrum.library.concordia.ca | 21409 | 4.87 | 200 | English, Transitional |
18369 | joom.ag | 21410 | 4.87 | 200 | HTML 5, English |
18370 | scielo.org | 21411 | 4.87 | 200 | No Lang |
18371 | cafeastrology.com | 21413 | 4.87 | 200 | HTML 5, English |
18372 | mms.tveyes.com | 21414 | 4.87 | 200 | No Lang |
18373 | sovon.nl | 21415 | 4.87 | 200 | HTML 5 |
18374 | wpjobmanager.com | 21416 | 4.87 | 200 | HTML 5, English |
18375 | zabars.com | 21417 | 4.87 | 200 | No Lang |
18376 | p-world.co.jp | 21419 | 4.87 | 200 | HTML 5, No Lang |
18377 | library.stanford.edu | 21420 | 4.87 | 200 | HTML 5, English |
18378 | marketplacepulse.com | 21422 | 4.87 | 200 | HTML 5, English |
18379 | www12.senado.leg.br | 21423 | 4.87 | 200 | HTML 5 |
18380 | help.eclipse.org | 21424 | 4.87 | 200 | No Lang |
18381 | logo.com | 21425 | 4.87 | 200 | HTML 5, English |
18382 | msue.anr.msu.edu | 21426 | 4.87 | 200 | HTML 5, English |
18383 | data.bnf.fr | 21427 | 4.87 | 200 | HTML 5, English |
18384 | bernmobil.ch | 21428 | 4.87 | 200 | HTML 5, English |
18385 | brandbrilliance.co.za | 21429 | 4.87 | 200 | HTML 5 |
18386 | lsbu.ac.uk | 21430 | 4.87 | 200 | HTML 5, English |
18387 | cs.pitt.edu | 21431 | 4.87 | 200 | HTML 5, English |
18388 | theartofeducation.edu | 21432 | 4.87 | 200 | HTML 5, English |
18389 | apmreports.org | 21433 | 4.87 | 200 | HTML 5, English |
18390 | emclient.com | 21435 | 4.87 | 200 | HTML 5, English |
18391 | gov.gg | 21436 | 4.87 | 200 | HTML 5, No Lang |
18392 | puc-rio.br | 21437 | 4.87 | 200 | HTML 5 |
18393 | daisydiskapp.com | 21438 | 4.87 | 200 | HTML 5, English |
18394 | smrt.com.sg | 21439 | 4.87 | 200 | HTML 5, No Lang |
18395 | hyundai.ru | 21440 | 4.87 | 200 | HTML 5 |
18396 | offi.fr | 21441 | 4.87 | 200 | HTML 5 |
18397 | imagineer.co.jp | 21442 | 4.87 | 200 | HTML 5 |
18398 | haw-hamburg.de | 21443 | 4.87 | 200 | HTML 5 |
18399 | tnuck.com | 21444 | 4.87 | 200 | HTML 5, English |
18400 | adventist.org | 21445 | 4.87 | 200 | HTML 5, English |
Data from: Open PageRank