Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
5301 | danpink.com | 6250 | 5.27 | 200 | HTML 5, No Lang |
5302 | webassembly.org | 6251 | 5.27 | 200 | HTML 5, English |
5303 | purevpn.com | 6252 | 5.27 | 200 | HTML 5, English |
5304 | dblp.org | 6254 | 5.27 | 200 | HTML 5, English |
5305 | marshall.edu | 6255 | 5.27 | 200 | HTML 5, English |
5306 | zola.com | 6256 | 5.27 | 200 | HTML 5, English |
5307 | simpleshop.cz | 6257 | 5.27 | 200 | HTML 5 |
5308 | nationwide.co.uk | 6258 | 5.27 | 200 | HTML 5, English |
5309 | glosbe.com | 6259 | 5.27 | 200 | HTML 5, English |
5310 | siliconrepublic.com | 6260 | 5.27 | 200 | HTML 5, English |
5311 | stadt-zuerich.ch | 6261 | 5.27 | 200 | HTML 5 |
5312 | drdobbs.com | 6262 | 5.27 | 200 | HTML 5, No Lang |
5313 | philly.com | 6263 | 5.27 | 200 | HTML 5, English |
5314 | ap.org | 6264 | 5.27 | 200 | HTML 5, English |
5315 | piwik.org | 6265 | 5.27 | 200 | HTML 5, English |
5316 | menafn.com | 6266 | 5.27 | 200 | HTML 5, No Lang |
5317 | ethnologue.com | 6267 | 5.27 | 200 | HTML 5, English |
5318 | onesignal.com | 6268 | 5.27 | 200 | HTML 5, English |
5319 | mapsengine.google.com | 6269 | 5.27 | 200 | HTML 5, English |
5320 | covid.joinzoe.com | 6270 | 5.27 | 200 | HTML 5, English |
5321 | securiti.ai | 6272 | 5.27 | 200 | HTML 5, English |
5322 | gazetadopovo.com.br | 6273 | 5.27 | 200 | HTML 5 |
5323 | biodiversitylibrary.org | 6274 | 5.27 | 200 | HTML 5, English |
5324 | reason.com | 6275 | 5.27 | 200 | HTML 5, English |
5325 | doximity.com | 6276 | 5.27 | 200 | HTML 5, English |
5326 | catchthemes.com | 6277 | 5.27 | 200 | HTML 5, No Lang |
5327 | washingtoncitypaper.com | 6278 | 5.27 | 200 | HTML 5, English |
5328 | clickcease.com | 6279 | 5.27 | 200 | HTML 5, English |
5329 | thetrainline.com | 6280 | 5.27 | 200 | HTML 5, English |
5330 | tinyletter.com | 6281 | 5.27 | 200 | HTML 5, English |
5331 | cplusplus.com | 6282 | 5.27 | 200 | HTML 5, No Lang |
5332 | philamuseum.org | 6283 | 5.27 | 200 | HTML 5, English |
5333 | tiqets.com | 6286 | 5.27 | 200 | HTML 5, English |
5334 | ninds.nih.gov | 6287 | 5.27 | 200 | HTML 5, English |
5335 | stanforddaily.com | 6288 | 5.27 | 200 | HTML 5, English |
5336 | kde.org | 6289 | 5.27 | 200 | HTML 5, English |
5337 | grow.google | 6290 | 5.27 | 200 | HTML 5, English |
5338 | lta.org.uk | 6292 | 5.27 | 200 | English |
5339 | engineering.purdue.edu | 6293 | 5.27 | 200 | HTML 5, English |
5340 | farm5.staticflickr.com | 6294 | 5.27 | 200 | No Lang |
5341 | community.spiceworks.com | 6296 | 5.27 | 200 | HTML 5, English |
5342 | blockstream.com | 6297 | 5.27 | 200 | HTML 5, English |
5343 | hdmi.org | 6298 | 5.27 | 200 | HTML 5, English |
5344 | romper.com | 6299 | 5.27 | 200 | HTML 5, English |
5345 | dresden.de | 6300 | 5.27 | 200 | HTML 5 |
5346 | wyze.com | 6301 | 5.27 | 200 | HTML 5, English |
5347 | redditinc.com | 6302 | 5.27 | 200 | HTML 5, English |
5348 | justpaste.me | 6303 | 5.27 | 200 | HTML 5, English |
5349 | about.att.com | 6304 | 5.27 | 200 | HTML 5, No Lang |
5350 | marketwired.com | 6305 | 5.27 | 200 | HTML 5, English |
5351 | tmb.cat | 6306 | 5.26 | 200 | HTML 5 |
5352 | art.thewalters.org | 6307 | 5.26 | 200 | HTML 5, English |
5353 | theodysseyonline.com | 6308 | 5.26 | 200 | HTML 5, No Lang |
5354 | tipeeestream.com | 6309 | 5.26 | 200 | HTML 5, No Lang |
5355 | ntt.com | 6310 | 5.26 | 200 | HTML 5 |
5356 | awardspace.com | 6311 | 5.26 | 200 | HTML 5, English |
5357 | edocr.com | 6312 | 5.26 | 200 | HTML 5, No Lang |
5358 | energystar.gov | 6313 | 5.26 | 200 | HTML 5, English |
5359 | camh.ca | 6314 | 5.26 | 200 | HTML 5, English |
5360 | dougal.gunters.org | 6315 | 5.26 | 200 | HTML 5, English |
5361 | activitystrea.ms | 6316 | 5.26 | 200 | HTML 5, No Lang |
5362 | bloomsbury.com | 6317 | 5.26 | 200 | HTML 5, English |
5363 | ksdk.com | 6318 | 5.26 | 200 | HTML 5, English |
5364 | worthpoint.com | 6320 | 5.26 | 200 | HTML 5, English |
5365 | keepachangelog.com | 6321 | 5.26 | 200 | No Lang |
5366 | bgca.org | 6322 | 5.26 | 200 | HTML 5, English |
5367 | nextinpact.com | 6323 | 5.26 | 200 | HTML 5 |
5368 | languages.oup.com | 6324 | 5.26 | 200 | HTML 5, English |
5369 | nastygal.com | 6325 | 5.26 | 200 | HTML 5, English |
5370 | wiki.apache.org | 6326 | 5.26 | 200 | HTML 5, English |
5371 | independent.ie | 6327 | 5.26 | 200 | HTML 5, No Lang |
5372 | eiga.com | 6330 | 5.26 | 200 | HTML 5 |
5373 | thelocal.it | 6331 | 5.26 | 200 | HTML 5, English |
5374 | gitlab.gnome.org | 6332 | 5.26 | 200 | HTML 5, No Lang |
5375 | getvoip.com | 6333 | 5.26 | 200 | HTML 5, English |
5376 | handheldmuseum.com | 6334 | 5.26 | 200 | No Lang, Transitional |
5377 | metacpan.org | 6336 | 5.26 | 200 | HTML 5, English |
5378 | airthings.com | 6337 | 5.26 | 200 | HTML 5, English |
5379 | epam.com | 6338 | 5.26 | 200 | HTML 5, English |
5380 | maccosmetics.com | 6339 | 5.26 | 200 | HTML 5, English |
5381 | pagesix.com | 6340 | 5.26 | 200 | HTML 5, English |
5382 | blueletterbible.org | 6341 | 5.26 | 200 | HTML 5, English |
5383 | portfolium.com | 6342 | 5.26 | 200 | HTML 5, English |
5384 | google.si | 6343 | 5.26 | 200 | HTML 5, English |
5385 | whatsmydns.net | 6344 | 5.26 | 200 | HTML 5, English |
5386 | closetcooking.com | 6345 | 5.26 | 200 | HTML 5, English |
5387 | idratherbewriting.com | 6346 | 5.26 | 200 | HTML 5, No Lang |
5388 | saskatchewan.ca | 6347 | 5.26 | 200 | HTML 5, English |
5389 | privacyinternational.org | 6348 | 5.26 | 200 | HTML 5, English |
5390 | mathworld.wolfram.com | 6349 | 5.26 | 200 | HTML 5, English |
5391 | first.org | 6352 | 5.26 | 200 | HTML 5, English |
5392 | jsonline.com | 6353 | 5.26 | 200 | HTML 5, English |
5393 | marketplace.atlassian.com | 6354 | 5.26 | 200 | HTML 5, English |
5394 | scholar.google.es | 6355 | 5.26 | 200 | HTML 5, No Lang |
5395 | esv.org | 6356 | 5.26 | 200 | HTML 5, English |
5396 | domusweb.it | 6357 | 5.26 | 200 | Strict |
5397 | www-personal.umich.edu | 6358 | 5.26 | 200 | HTML 5, No Lang |
5398 | hechingerreport.org | 6359 | 5.26 | 200 | HTML 5, English |
5399 | process.st | 6360 | 5.26 | 200 | HTML 5, English |
5400 | lalalab.com | 6361 | 5.26 | 200 | HTML 5, English |
Data from: Open PageRank