Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
17301 | lords.org | 20172 | 4.88 | 200 | HTML 5, No Lang |
17302 | albanian.cri.cn | 20173 | 4.88 | 200 | HTML 5, English |
17303 | ti.arc.nasa.gov | 20174 | 4.88 | 200 | HTML 5, English |
17304 | doomworld.com | 20175 | 4.88 | 200 | HTML 5, English |
17305 | ricette.giallozafferano.it | 20176 | 4.88 | 200 | HTML 5 |
17306 | 90daykorean.com | 20177 | 4.88 | 200 | HTML 5, English |
17307 | p2pu.org | 20178 | 4.88 | 200 | HTML 5, English |
17308 | humanorigins.si.edu | 20179 | 4.88 | 200 | HTML 5, English |
17309 | veronalabs.com | 20180 | 4.88 | 200 | English |
17310 | books.google.com.sg | 20181 | 4.88 | 200 | HTML 5, No Lang |
17311 | bensound.com | 20182 | 4.88 | 200 | HTML 5, English |
17312 | ww2.kqed.org | 20183 | 4.88 | 200 | HTML 5, English |
17313 | www1.eere.energy.gov | 20184 | 4.88 | 200 | No Lang |
17314 | monashfodmap.com | 20185 | 4.88 | 200 | HTML 5, No Lang |
17315 | travel.navitime.com | 20186 | 4.88 | 200 | HTML 5, English |
17316 | math.bas.bg | 20187 | 4.88 | 200 | HTML 5 |
17317 | portugalresident.com | 20188 | 4.88 | 200 | HTML 5, English |
17318 | cattolica.it | 20189 | 4.88 | 200 | HTML 5 |
17319 | dbaron.org | 20190 | 4.88 | 200 | HTML 5, English |
17320 | the-aiff.com | 20191 | 4.88 | 200 | HTML 5, No Lang |
17321 | karlsruhe.de | 20193 | 4.88 | 200 | HTML 5 |
17322 | ipython.org | 20195 | 4.88 | 200 | HTML 5, English |
17323 | developers.taxjar.com | 20196 | 4.88 | 200 | HTML 5, No Lang |
17324 | paycor.com | 20197 | 4.88 | 200 | HTML 5, English |
17325 | bluefish.openoffice.nl | 20198 | 4.88 | 200 | English, Strict |
17326 | baremetrics.com | 20199 | 4.88 | 200 | HTML 5, English |
17327 | kvue.com | 20200 | 4.88 | 200 | HTML 5, English |
17328 | bluethumb.com.au | 20201 | 4.88 | 200 | HTML 5, English |
17329 | land.allears.net | 20202 | 4.88 | 200 | HTML 5, English |
17330 | office.xerox.com | 20203 | 4.88 | 200 | HTML 5, English |
17331 | vectorstock.com | 20204 | 4.88 | 200 | HTML 5, English |
17332 | wearenotmartha.com | 20205 | 4.88 | 200 | HTML 5, English |
17333 | paradoxinteractive.com | 20207 | 4.88 | 200 | HTML 5, English |
17334 | ngpvan.com | 20209 | 4.88 | 200 | HTML 5, English |
17335 | goo.ne.jp | 20211 | 4.88 | 200 | HTML 5 |
17336 | news.syr.edu | 20212 | 4.88 | 200 | HTML 5, English |
17337 | latitudes.org | 20213 | 4.88 | 200 | HTML 5, English |
17338 | bitmex.com | 20216 | 4.88 | 200 | HTML 5, No Lang |
17339 | noscript.net | 20217 | 4.88 | 200 | HTML 5, English |
17340 | amsmeteors.org | 20218 | 4.88 | 200 | HTML 5, English |
17341 | barilliance.com | 20219 | 4.88 | 200 | HTML 5, English |
17342 | greatamericaneclipse.com | 20220 | 4.88 | 200 | HTML 5, English |
17343 | qurium.org | 20221 | 4.88 | 200 | HTML 5, English |
17344 | aaroads.com | 20222 | 4.88 | 200 | HTML 5, English |
17345 | ecorner.stanford.edu | 20224 | 4.88 | 200 | HTML 5, English |
17346 | iwillteachyoutoberich.com | 20225 | 4.88 | 200 | HTML 5, English |
17347 | wivb.com | 20227 | 4.88 | 200 | HTML 5, English |
17348 | hrcak.srce.hr | 20228 | 4.88 | 200 | HTML 5 |
17349 | gi-de.com | 20229 | 4.88 | 200 | HTML 5, English |
17350 | wetv.vip | 20230 | 4.88 | 200 | HTML 5, English |
17351 | spritmonitor.de | 20231 | 4.88 | 200 | No Lang |
17352 | therobotreport.com | 20232 | 4.88 | 200 | HTML 5, English |
17353 | wiki.dbpedia.org | 20233 | 4.88 | 200 | HTML 5, English |
17354 | papertrailapp.com | 20234 | 4.88 | 200 | HTML 5, English |
17355 | wordpress.slack.com | 20235 | 4.88 | 200 | HTML 5, English |
17356 | mzv.cz | 20236 | 4.88 | 200 | English, Transitional |
17357 | banano.cc | 20238 | 4.88 | 200 | HTML 5, English |
17358 | inventhelp.com | 20239 | 4.88 | 200 | HTML 5, English |
17359 | david.shanske.com | 20241 | 4.88 | 200 | HTML 5, English |
17360 | oxfordre.com | 20242 | 4.88 | 200 | HTML 5, English |
17361 | americanancestors.org | 20243 | 4.88 | 200 | HTML 5, English |
17362 | fox5dc.com | 20244 | 4.88 | 200 | HTML 5, English |
17363 | bundeswahlleiter.de | 20246 | 4.88 | 200 | HTML 5 |
17364 | sfedu.ru | 20247 | 4.88 | 200 | HTML 5 |
17365 | tandem.chat | 20248 | 4.88 | 200 | HTML 5, English |
17366 | cakewallet.com | 20249 | 4.88 | 200 | HTML 5, English |
17367 | healthpartners.com | 20250 | 4.88 | 200 | HTML 5, English |
17368 | smartbrief.com | 20251 | 4.88 | 200 | HTML 5, English |
17369 | skaut.cz | 20252 | 4.88 | 200 | HTML 5 |
17370 | tidymom.net | 20254 | 4.88 | 200 | HTML 5, English |
17371 | users.ece.cmu.edu | 20255 | 4.88 | 200 | HTML 5, English |
17372 | no-margin-for-errors.com | 20256 | 4.88 | 200 | HTML 5, English |
17373 | fide.com | 20257 | 4.88 | 200 | HTML 5, English |
17374 | opensourceecology.org | 20258 | 4.88 | 200 | HTML 5, English |
17375 | albacross.com | 20259 | 4.88 | 200 | HTML 5, English |
17376 | capital.sp.gov.br | 20260 | 4.88 | 200 | HTML 5 |
17377 | bmcmedresmethodol.biomedcentral.com | 20261 | 4.88 | 200 | HTML 5, English |
17378 | thepeninsulaqatar.com | 20262 | 4.88 | 200 | HTML 5, English |
17379 | metanoia.org | 20263 | 4.88 | 200 | No Lang |
17380 | eurosoftlab.com | 20264 | 4.88 | 200 | HTML 5, English |
17381 | phoenixheart.net | 20265 | 4.88 | 200 | English, Transitional |
17382 | mindat.org | 20266 | 4.88 | 200 | HTML 5, No Lang |
17383 | nasponline.org | 20267 | 4.88 | 200 | HTML 5, English |
17384 | credit-agricole.com | 20268 | 4.88 | 200 | HTML 5 |
17385 | tepapa.govt.nz | 20270 | 4.88 | 200 | HTML 5, No Lang |
17386 | mediagoblin.org | 20271 | 4.88 | 200 | HTML 5, No Lang |
17387 | lw.com | 20272 | 4.88 | 200 | HTML 5, English |
17388 | frase.io | 20273 | 4.88 | 200 | HTML 5, English |
17389 | data.giss.nasa.gov | 20274 | 4.88 | 200 | HTML 5, English |
17390 | idolish7.com | 20275 | 4.88 | 200 | HTML 5 |
17391 | quillette.com | 20276 | 4.88 | 200 | HTML 5, English |
17392 | walletofsatoshi.com | 20277 | 4.88 | 200 | HTML 5, English |
17393 | sfsu.edu | 20278 | 4.88 | 200 | HTML 5, English |
17394 | marchforourlives.com | 20280 | 4.88 | 200 | HTML 5, English |
17395 | pegi.info | 20281 | 4.88 | 200 | HTML 5, English |
17396 | dcceew.gov.au | 20282 | 4.88 | 200 | HTML 5, English |
17397 | about.americanexpress.com | 20283 | 4.88 | 200 | No Lang |
17398 | etherpad.wikimedia.org | 20284 | 4.88 | 200 | HTML 5, No Lang |
17399 | df.cl | 20285 | 4.88 | 200 | HTML 5 |
17400 | boohooman.com | 20287 | 4.88 | 200 | HTML 5, English |
Data from: Open PageRank