Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19101 | laws.justice.gc.ca | 22266 | 4.86 | 200 | HTML 5, English |
19102 | isocpp.org | 22267 | 4.86 | 200 | HTML 5, English |
19103 | recordtv.r7.com | 22268 | 4.86 | 200 | HTML 5 |
19104 | focus.ti.com | 22270 | 4.86 | 200 | HTML 5, English |
19105 | evolutionnews.org | 22272 | 4.86 | 200 | HTML 5, English |
19106 | columbusunderground.com | 22273 | 4.86 | 200 | English |
19107 | governor.wa.gov | 22276 | 4.86 | 200 | HTML 5, English |
19108 | zinnedproject.org | 22278 | 4.86 | 200 | HTML 5, English |
19109 | west-wind.com | 22279 | 4.86 | 200 | HTML 5, No Lang |
19110 | abc.org | 22280 | 4.86 | 200 | English, Transitional |
19111 | leaningtech.com | 22284 | 4.86 | 200 | HTML 5, English |
19112 | uri.edu | 22285 | 4.86 | 200 | HTML 5, English |
19113 | amff.com | 22287 | 4.86 | 200 | HTML 5, English |
19114 | lawsofux.com | 22289 | 4.86 | 200 | HTML 5, English |
19115 | gwebpro.com | 22290 | 4.86 | 200 | HTML 5, English |
19116 | avcr.cz | 22292 | 4.86 | 200 | HTML 5 |
19117 | open.firstory.me | 22293 | 4.86 | 200 | HTML 5, No Lang |
19118 | mimuw.edu.pl | 22294 | 4.86 | 200 | HTML 5, No Lang |
19119 | breakdance.com | 22295 | 4.86 | 200 | HTML 5, English |
19120 | folhabv.com.br | 22296 | 4.86 | 200 | HTML 5 |
19121 | mrclay.org | 22297 | 4.86 | 200 | HTML 5, English |
19122 | jaworowi.cz | 22298 | 4.86 | 200 | HTML 5 |
19123 | kickbox.com | 22299 | 4.86 | 200 | HTML 5, English |
19124 | tynker.com | 22300 | 4.86 | 200 | HTML 5, English |
19125 | devops-research.com | 22302 | 4.86 | 200 | HTML 5, English |
19126 | r4ds.had.co.nz | 22303 | 4.86 | 200 | HTML 5, English |
19127 | versiti.org | 22304 | 4.86 | 200 | HTML 5, English |
19128 | mikelittle.org | 22305 | 4.86 | 200 | English, Transitional |
19129 | customerthink.com | 22306 | 4.86 | 200 | English |
19130 | jaas.8x8.vc | 22307 | 4.86 | 200 | HTML 5, No Lang |
19131 | irtf.org | 22308 | 4.86 | 200 | HTML 5, English |
19132 | stingray.com | 22309 | 4.86 | 200 | HTML 5, English |
19133 | startup.google.com | 22310 | 4.86 | 200 | HTML 5, English |
19134 | comune.pordenone.it | 22311 | 4.86 | 200 | HTML 5 |
19135 | fair.org | 22312 | 4.86 | 200 | HTML 5, English |
19136 | universalpictures.com | 22313 | 4.86 | 200 | HTML 5, English |
19137 | mef.net | 22314 | 4.86 | 200 | HTML 5, English |
19138 | globallogic.com | 22315 | 4.86 | 200 | HTML 5, English |
19139 | rsc.org.uk | 22316 | 4.86 | 200 | HTML 5, No Lang |
19140 | gjensidige.no | 22317 | 4.86 | 200 | HTML 5 |
19141 | writing.com | 22318 | 4.86 | 200 | HTML 5, No Lang |
19142 | citi.com | 22321 | 4.86 | 200 | HTML 5, English |
19143 | applieddigitalskills.withgoogle.com | 22322 | 4.86 | 200 | HTML 5, English |
19144 | dfs.ny.gov | 22324 | 4.86 | 200 | HTML 5, English |
19145 | git.jami.net | 22325 | 4.86 | 200 | HTML 5, English |
19146 | neuroscience.cam.ac.uk | 22326 | 4.86 | 200 | HTML 5, English |
19147 | de.scribd.com | 22327 | 4.86 | 200 | HTML 5, English |
19148 | navicat.com | 22328 | 4.86 | 200 | HTML 5, English |
19149 | onesearch.com | 22329 | 4.86 | 200 | HTML 5, English |
19150 | nippon1.jp | 22330 | 4.86 | 200 | HTML 5 |
19151 | nesslabs.com | 22331 | 4.86 | 200 | HTML 5, English |
19152 | blogs.bmj.com | 22332 | 4.86 | 200 | English, Transitional |
19153 | publish.twitter.com | 22333 | 4.86 | 200 | HTML 5, English |
19154 | ytechb.com | 22334 | 4.86 | 200 | HTML 5, English |
19155 | indiarailinfo.com | 22335 | 4.86 | 200 | HTML 5, No Lang |
19156 | svenskalag.se | 22336 | 4.86 | 200 | HTML 5, English |
19157 | iga.in.gov | 22337 | 4.86 | 200 | HTML 5, English |
19158 | infobip.com | 22338 | 4.86 | 200 | HTML 5, English |
19159 | procreate.com | 22339 | 4.86 | 200 | HTML 5, English |
19160 | gi.alaska.edu | 22340 | 4.86 | 200 | HTML 5, English |
19161 | audiobooks.com | 22341 | 4.86 | 200 | HTML 5, English |
19162 | climatecommunication.yale.edu | 22342 | 4.86 | 200 | HTML 5, English |
19163 | mez.ink | 22343 | 4.86 | 200 | HTML 5, No Lang |
19164 | elcolombiano.com | 22344 | 4.86 | 200 | HTML 5 |
19165 | emaratalyoum.com | 22345 | 4.86 | 200 | HTML 5 |
19166 | matroska.org | 22346 | 4.86 | 200 | HTML 5, No Lang |
19167 | mainehistory.org | 22349 | 4.86 | 200 | HTML 5, English |
19168 | alfred.camera | 22350 | 4.86 | 200 | HTML 5, English |
19169 | internews.org | 22351 | 4.86 | 200 | HTML 5, English |
19170 | stephenwolfram.com | 22352 | 4.86 | 200 | HTML 5, English |
19171 | insidebigdata.com | 22353 | 4.86 | 200 | English, Transitional |
19172 | ccarnet.org | 22354 | 4.86 | 200 | HTML 5, English |
19173 | ipgbook.com | 22357 | 4.86 | 200 | No Lang, Transitional |
19174 | yacy.net | 22358 | 4.86 | 200 | HTML 5, English |
19175 | iodonna.it | 22360 | 4.86 | 200 | HTML 5 |
19176 | ctftime.org | 22361 | 4.86 | 200 | HTML 5, English |
19177 | homepages.math.uic.edu | 22362 | 4.86 | 200 | No Lang |
19178 | knot-dns.cz | 22364 | 4.86 | 200 | HTML 5, English |
19179 | nyckel.com | 22365 | 4.86 | 200 | HTML 5, English |
19180 | bibliosansfrontieres.org | 22366 | 4.86 | 200 | HTML 5 |
19181 | mashed.com | 22367 | 4.86 | 200 | HTML 5, English |
19182 | m.att.com | 22368 | 4.86 | 200 | HTML 5, English |
19183 | getalby.com | 22369 | 4.86 | 200 | HTML 5, English |
19184 | radiofg.com | 22370 | 4.86 | 200 | HTML 5 |
19185 | kovshenin.com | 22371 | 4.86 | 200 | HTML 5, English |
19186 | cocoapods.org | 22372 | 4.86 | 200 | HTML 5, English |
19187 | nilambar.net | 22373 | 4.86 | 200 | HTML 5, English |
19188 | swiggy.com | 22374 | 4.86 | 200 | HTML 5, English |
19189 | dvb.de | 22375 | 4.86 | 200 | HTML 5 |
19190 | archive.fosdem.org | 22378 | 4.86 | 200 | HTML 5, English |
19191 | allbookstores.com | 22380 | 4.86 | 200 | HTML 5, English |
19192 | boots.com | 22381 | 4.86 | 200 | HTML 5, English |
19193 | history.ac.uk | 22382 | 4.86 | 200 | HTML 5, English |
19194 | techengage.com | 22383 | 4.86 | 200 | English |
19195 | leisurejobs.com | 22384 | 4.86 | 200 | HTML 5, English |
19196 | sharjah.ae | 22385 | 4.86 | 200 | English |
19197 | transports.nouvelle-aquitaine.fr | 22386 | 4.86 | 200 | HTML 5 |
19198 | admonsters.com | 22389 | 4.86 | 200 | HTML 5, English |
19199 | phylodiversity.net | 22390 | 4.86 | 200 | |
19200 | partypoker.com | 22391 | 4.86 | 200 | HTML 5, English |
Data from: Open PageRank