Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
4301 | iconosquare.com | 5105 | 5.35 | 200 | HTML 5, English |
4302 | developer.spotify.com | 5106 | 5.35 | 200 | HTML 5, English |
4303 | pond5.com | 5107 | 5.35 | 200 | HTML 5, English |
4304 | insta360.com | 5108 | 5.35 | 200 | HTML 5, English |
4305 | voice.google.com | 5109 | 5.35 | 200 | English |
4306 | mobygames.com | 5110 | 5.35 | 200 | HTML 5, English |
4307 | sallysbakingaddiction.com | 5111 | 5.35 | 200 | HTML 5, English |
4308 | poststatus.com | 5112 | 5.35 | 200 | HTML 5, English |
4309 | st.com | 5113 | 5.35 | 200 | HTML 5, English |
4310 | frontlinedefenders.org | 5114 | 5.35 | 200 | HTML 5, English |
4311 | jamsadr.com | 5115 | 5.35 | 200 | HTML 5, English |
4312 | data.europa.eu | 5116 | 5.35 | 200 | HTML 5, English |
4313 | svs.gsfc.nasa.gov | 5117 | 5.35 | 200 | HTML 5, English |
4314 | typescriptlang.org | 5118 | 5.35 | 200 | HTML 5, English |
4315 | carscoops.com | 5119 | 5.35 | 200 | HTML 5, English |
4316 | oceanservice.noaa.gov | 5120 | 5.35 | 200 | HTML 5, English |
4317 | keepersecurity.com | 5122 | 5.35 | 200 | HTML 5, English |
4318 | mathworks.com | 5123 | 5.35 | 200 | HTML 5, English |
4319 | bc.edu | 5124 | 5.35 | 200 | HTML 5, English |
4320 | baltimoresun.com | 5125 | 5.35 | 200 | HTML 5, English |
4321 | hc-sc.gc.ca | 5126 | 5.35 | 200 | English, Strict |
4322 | jcrew.com | 5127 | 5.35 | 200 | HTML 5, English |
4323 | workers.cloudflare.com | 5129 | 5.35 | 200 | HTML 5, English |
4324 | umd.edu | 5130 | 5.35 | 200 | HTML 5, English |
4325 | nar.realtor | 5131 | 5.35 | 200 | HTML 5, English |
4326 | skfb.ly | 5132 | 5.35 | 200 | HTML 5, English |
4327 | majorgeeks.com | 5134 | 5.35 | 200 | HTML 5, English |
4328 | service-public.fr | 5135 | 5.35 | 200 | HTML 5 |
4329 | univision.com | 5136 | 5.35 | 200 | HTML 5 |
4330 | php.watch | 5137 | 5.35 | 200 | HTML 5, English |
4331 | jobs.google.com | 5138 | 5.35 | 200 | HTML 5, English |
4332 | businessinsider.de | 5139 | 5.35 | 200 | HTML 5 |
4333 | mbta.com | 5140 | 5.35 | 200 | HTML 5, English |
4334 | cityam.com | 5141 | 5.35 | 200 | HTML 5, English |
4335 | archinect.com | 5142 | 5.35 | 200 | English, Strict |
4336 | heritage.org | 5143 | 5.35 | 200 | HTML 5, English |
4337 | yogajournal.com | 5144 | 5.35 | 200 | HTML 5, English |
4338 | ansa.it | 5145 | 5.35 | 200 | HTML 5 |
4339 | acefitness.org | 5146 | 5.35 | 200 | HTML 5, English |
4340 | cedcommerce.com | 5147 | 5.35 | 200 | HTML 5, English |
4341 | sbir.gov | 5148 | 5.35 | 200 | HTML 5, English |
4342 | csun.edu | 5149 | 5.35 | 200 | HTML 5, English |
4343 | backstage.com | 5150 | 5.35 | 200 | HTML 5, English |
4344 | wellfound.com | 5151 | 5.35 | 200 | HTML 5, English |
4345 | ask.com | 5152 | 5.35 | 200 | HTML 5, English |
4346 | blog.sina.com.cn | 5153 | 5.35 | 200 | HTML 5, No Lang |
4347 | blog.ted.com | 5154 | 5.35 | 200 | HTML 5, English |
4348 | webopedia.com | 5155 | 5.35 | 200 | HTML 5, English |
4349 | specialolympics.org | 5156 | 5.35 | 200 | HTML 5, English |
4350 | bank.gov.ua | 5157 | 5.35 | 200 | HTML 5 |
4351 | surfshark.com | 5158 | 5.35 | 200 | HTML 5, English |
4352 | blinkist.com | 5159 | 5.35 | 200 | HTML 5, English |
4353 | fail2ban.org | 5161 | 5.35 | 200 | HTML 5, English |
4354 | v.youku.com | 5162 | 5.35 | 200 | No Lang |
4355 | newspapers.com | 5163 | 5.35 | 200 | HTML 5, English |
4356 | splitit.com | 5164 | 5.35 | 200 | HTML 5, English |
4357 | mpi-inf.mpg.de | 5165 | 5.35 | 200 | HTML 5, English |
4358 | education.ti.com | 5166 | 5.35 | 200 | HTML 5, English |
4359 | ctt.ec | 5167 | 5.35 | 200 | HTML 5, English |
4360 | ideone.com | 5169 | 5.35 | 200 | HTML 5, English |
4361 | projects.fivethirtyeight.com | 5170 | 5.35 | 200 | HTML 5, English |
4362 | ilga.gov | 5171 | 5.35 | 200 | English |
4363 | neopets.com | 5172 | 5.35 | 200 | HTML 5, English |
4364 | qrz.com | 5174 | 5.35 | 200 | HTML 5, English |
4365 | www3.epa.gov | 5175 | 5.35 | 200 | HTML 5, English |
4366 | api.drupal.org | 5176 | 5.35 | 200 | HTML 5, English |
4367 | bundesfinanzministerium.de | 5177 | 5.35 | 200 | HTML 5 |
4368 | wikis.ec.europa.eu | 5178 | 5.35 | 200 | HTML 5, English |
4369 | laughingsquid.com | 5180 | 5.35 | 200 | HTML 5, English |
4370 | madrid.es | 5181 | 5.35 | 200 | HTML 5 |
4371 | stern.nyu.edu | 5182 | 5.35 | 200 | HTML 5, English |
4372 | sdtimes.com | 5183 | 5.34 | 200 | HTML 5, English |
4373 | scielo.br | 5184 | 5.34 | 200 | HTML 5 |
4374 | security.stackexchange.com | 5185 | 5.34 | 200 | HTML 5, English |
4375 | climate.nasa.gov | 5186 | 5.34 | 200 | HTML 5, English |
4376 | data.gv.at | 5189 | 5.34 | 200 | HTML 5, English |
4377 | graphql.org | 5190 | 5.34 | 200 | HTML 5, English |
4378 | nanowrimo.org | 5191 | 5.34 | 200 | HTML 5, English |
4379 | turbotax.intuit.com | 5192 | 5.34 | 200 | HTML 5, English |
4380 | madmimi.com | 5193 | 5.34 | 200 | HTML 5, English |
4381 | futureoflife.org | 5195 | 5.34 | 200 | HTML 5, English |
4382 | karriere.at | 5197 | 5.34 | 200 | HTML 5 |
4383 | dolby.com | 5198 | 5.34 | 200 | HTML 5, English |
4384 | eeas.europa.eu | 5199 | 5.34 | 200 | HTML 5, No Lang |
4385 | canadiantire.ca | 5200 | 5.34 | 200 | HTML 5, English |
4386 | brighttalk.com | 5201 | 5.34 | 200 | HTML 5, English |
4387 | minne.com | 5202 | 5.34 | 200 | HTML 5, No Lang |
4388 | crbug.com | 5203 | 5.34 | 200 | HTML 5, English |
4389 | fosdem.org | 5205 | 5.34 | 200 | HTML 5, English |
4390 | publons.com | 5206 | 5.34 | 200 | HTML 5, English |
4391 | stltoday.com | 5207 | 5.34 | 200 | HTML 5, English |
4392 | 8x8.com | 5208 | 5.34 | 200 | HTML 5, English |
4393 | ready.gov | 5209 | 5.34 | 200 | HTML 5, English |
4394 | hup.harvard.edu | 5210 | 5.34 | 200 | HTML 5, English |
4395 | pushover.net | 5211 | 5.34 | 200 | HTML 5, English |
4396 | ncr.com | 5212 | 5.34 | 200 | HTML 5, No Lang |
4397 | events.ccc.de | 5213 | 5.34 | 200 | HTML 5 |
4398 | deque.com | 5214 | 5.34 | 200 | HTML 5, English |
4399 | missoulian.com | 5215 | 5.34 | 200 | HTML 5, English |
4400 | ilgiornale.it | 5216 | 5.34 | 200 | HTML 5 |
Data from: Open PageRank