Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18201 | ruttl.com | 21212 | 4.87 | 200 | HTML 5, English |
18202 | faberge.com | 21213 | 4.87 | 200 | HTML 5, English |
18203 | ndpr.nd.edu | 21214 | 4.87 | 200 | HTML 5, English |
18204 | maxrealestateexposure.com | 21215 | 4.87 | 200 | HTML 5, English |
18205 | masshist.org | 21216 | 4.87 | 200 | HTML 5, No Lang |
18206 | us19.campaign-archive.com | 21217 | 4.87 | 200 | No Lang |
18207 | npmtrends.com | 21218 | 4.87 | 200 | HTML 5, English |
18208 | docraptor.com | 21219 | 4.87 | 200 | HTML 5, English |
18209 | mintpressnews.com | 21220 | 4.87 | 200 | HTML 5, English |
18210 | velvetropes.com | 21221 | 4.87 | 200 | HTML 5, English |
18211 | cypherpunks.ca | 21222 | 4.87 | 200 | No Lang |
18212 | classifiedads.com | 21223 | 4.87 | 200 | HTML 5, No Lang |
18213 | isic.org | 21224 | 4.87 | 200 | HTML 5, English |
18214 | barnfinds.com | 21225 | 4.87 | 200 | English, Transitional |
18215 | informatik.uni-leipzig.de | 21226 | 4.87 | 200 | HTML 5 |
18216 | wwwn.cdc.gov | 21227 | 4.87 | 200 | No Lang |
18217 | airvistara.com | 21229 | 4.87 | 200 | HTML 5, English |
18218 | premiumtimesng.com | 21230 | 4.87 | 200 | HTML 5, English |
18219 | aldiko.com | 21231 | 4.87 | 200 | HTML 5, English |
18220 | volleybal.nl | 21232 | 4.87 | 200 | HTML 5 |
18221 | rhg.com | 21233 | 4.87 | 200 | HTML 5, English |
18222 | science20.com | 21234 | 4.87 | 200 | English |
18223 | hypixel.net | 21236 | 4.87 | 200 | HTML 5, English |
18224 | upday.com | 21238 | 4.87 | 200 | HTML 5, English |
18225 | wps.com | 21239 | 4.87 | 200 | HTML 5, English |
18226 | support.brightcove.com | 21240 | 4.87 | 200 | HTML 5, English |
18227 | live.bible.is | 21242 | 4.87 | 200 | HTML 5, No Lang |
18228 | truste.com | 21243 | 4.87 | 200 | HTML 5, English |
18229 | z.umn.edu | 21244 | 4.87 | 200 | HTML 5, English |
18230 | nessy.com | 21245 | 4.87 | 200 | HTML 5, English |
18231 | derrystrabane.com | 21246 | 4.87 | 200 | HTML 5, English |
18232 | codechef.com | 21248 | 4.87 | 200 | HTML 5, English |
18233 | incommon.org | 21250 | 4.87 | 200 | HTML 5, English |
18234 | bikeexif.com | 21251 | 4.87 | 200 | HTML 5, English |
18235 | sso.org.sg | 21253 | 4.87 | 200 | HTML 5, English |
18236 | tennis.com.au | 21254 | 4.87 | 200 | HTML 5, English |
18237 | cov-lineages.org | 21255 | 4.87 | 200 | HTML 5, English |
18238 | multichoice.com | 21256 | 4.87 | 200 | HTML 5, English |
18239 | 15five.com | 21257 | 4.87 | 200 | HTML 5, English |
18240 | sccn.ucsd.edu | 21258 | 4.87 | 200 | HTML 5, English |
18241 | fujifilm.jp | 21259 | 4.87 | 200 | HTML 5 |
18242 | masdearte.com | 21260 | 4.87 | 200 | Strict |
18243 | macquarie.com | 21261 | 4.87 | 200 | HTML 5, English |
18244 | admin.microsoft.com | 21262 | 4.87 | 200 | No Lang |
18245 | consensys.io | 21263 | 4.87 | 200 | HTML 5, English |
18246 | simonsaysstamp.com | 21264 | 4.87 | 200 | HTML 5, English |
18247 | provincia.tn.it | 21265 | 4.87 | 200 | HTML 5 |
18248 | jkrowling.com | 21267 | 4.87 | 200 | HTML 5, English |
18249 | boston.cbslocal.com | 21268 | 4.87 | 200 | HTML 5, English |
18250 | online-literature.com | 21270 | 4.87 | 200 | HTML 5, English |
18251 | edinburghnews.scotsman.com | 21271 | 4.87 | 200 | HTML 5, English |
18252 | microsoftedgeinsider.com | 21272 | 4.87 | 200 | HTML 5, English |
18253 | leprogres.fr | 21274 | 4.87 | 200 | HTML 5 |
18254 | wirexapp.com | 21275 | 4.87 | 200 | HTML 5, English |
18255 | veterans.ny.gov | 21276 | 4.87 | 200 | HTML 5, English |
18256 | electron.atom.io | 21277 | 4.87 | 200 | HTML 5, English |
18257 | blogs.pravda.com.ua | 21278 | 4.87 | 200 | HTML 5, No Lang |
18258 | cruisefever.net | 21280 | 4.87 | 200 | English |
18259 | dsal.uchicago.edu | 21281 | 4.87 | 200 | No Lang, Transitional |
18260 | dsm5.org | 21282 | 4.87 | 200 | HTML 5, English |
18261 | magnite.com | 21283 | 4.87 | 200 | HTML 5, English |
18262 | protocol.ai | 21284 | 4.87 | 200 | HTML 5, English |
18263 | living.corriere.it | 21285 | 4.87 | 200 | HTML 5 |
18264 | g.dev | 21287 | 4.87 | 200 | HTML 5, English |
18265 | tinkerpop.apache.org | 21288 | 4.87 | 200 | No Lang |
18266 | help.cbp.gov | 21290 | 4.87 | 200 | HTML 5, English |
18267 | ftp.ncbi.nlm.nih.gov | 21291 | 4.87 | 200 | No Lang |
18268 | forum.paradoxplaza.com | 21292 | 4.87 | 200 | HTML 5, English |
18269 | mobil.abus.com | 21293 | 4.87 | 200 | HTML 5, English |
18270 | bankmycell.com | 21296 | 4.87 | 200 | HTML 5, English |
18271 | bayut.com | 21297 | 4.87 | 200 | HTML 5, English |
18272 | mozzartsport.com | 21298 | 4.87 | 200 | HTML 5 |
18273 | fromsmash.com | 21299 | 4.87 | 200 | HTML 5, English |
18274 | sparefoot.com | 21300 | 4.87 | 200 | HTML 5, English |
18275 | gop.com | 21302 | 4.87 | 200 | HTML 5, English |
18276 | hoteltonight.com | 21303 | 4.87 | 200 | HTML 5, English |
18277 | yatzer.com | 21304 | 4.87 | 200 | HTML 5, English |
18278 | sahealth.sa.gov.au | 21305 | 4.87 | 200 | HTML 5, English |
18279 | tcelectronic.com | 21306 | 4.87 | 200 | HTML 5, English |
18280 | greece.greekreporter.com | 21307 | 4.87 | 200 | English |
18281 | csq.com | 21308 | 4.87 | 200 | HTML 5, English |
18282 | bdew.de | 21310 | 4.87 | 200 | HTML 5 |
18283 | 1000aircraftphotos.com | 21311 | 4.87 | 200 | No Lang |
18284 | m.jpost.com | 21312 | 4.87 | 200 | HTML 5, English |
18285 | wuppertal.de | 21313 | 4.87 | 200 | HTML 5 |
18286 | uni-hildesheim.de | 21314 | 4.87 | 200 | HTML 5 |
18287 | truecrypt.org | 21315 | 4.87 | 200 | HTML 5, English |
18288 | regionh.dk | 21316 | 4.87 | 200 | HTML 5, No Lang |
18289 | nfsa.gov.au | 21317 | 4.87 | 200 | HTML 5, English |
18290 | gettimely.com | 21318 | 4.87 | 200 | HTML 5, English |
18291 | rock-am-ring.com | 21319 | 4.87 | 200 | HTML 5 |
18292 | stattrek.com | 21320 | 4.87 | 200 | HTML 5, English |
18293 | niaaa.nih.gov | 21322 | 4.87 | 200 | HTML 5, English |
18294 | kyoto-np.co.jp | 21323 | 4.87 | 200 | HTML 5 |
18295 | datanet.co.kr | 21325 | 4.87 | 200 | HTML 5 |
18296 | designforwp.com | 21327 | 4.87 | 200 | HTML 5, English |
18297 | epsrc.ac.uk | 21328 | 4.87 | 200 | HTML 5, English |
18298 | singpost.com | 21329 | 4.87 | 200 | HTML 5, English |
18299 | environment.gov.au | 21330 | 4.87 | 200 | HTML 5, English |
18300 | formpl.us | 21331 | 4.87 | 200 | HTML 5, English |
Data from: Open PageRank