Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
18201 | livescore.com | 21211 | 4.87 | 200 | HTML 5, English |
18202 | ruttl.com | 21212 | 4.87 | 200 | HTML 5, English |
18203 | faberge.com | 21213 | 4.87 | 200 | HTML 5, English |
18204 | ndpr.nd.edu | 21214 | 4.87 | 200 | HTML 5, English |
18205 | maxrealestateexposure.com | 21215 | 4.87 | 200 | HTML 5, English |
18206 | masshist.org | 21216 | 4.87 | 200 | HTML 5, No Lang |
18207 | us19.campaign-archive.com | 21217 | 4.87 | 200 | No Lang |
18208 | npmtrends.com | 21218 | 4.87 | 200 | HTML 5, English |
18209 | docraptor.com | 21219 | 4.87 | 200 | HTML 5, English |
18210 | mintpressnews.com | 21220 | 4.87 | 200 | HTML 5, English |
18211 | velvetropes.com | 21221 | 4.87 | 200 | HTML 5, English |
18212 | cypherpunks.ca | 21222 | 4.87 | 200 | No Lang |
18213 | classifiedads.com | 21223 | 4.87 | 200 | HTML 5, No Lang |
18214 | isic.org | 21224 | 4.87 | 200 | HTML 5, English |
18215 | barnfinds.com | 21225 | 4.87 | 200 | English, Transitional |
18216 | informatik.uni-leipzig.de | 21226 | 4.87 | 200 | HTML 5 |
18217 | wwwn.cdc.gov | 21227 | 4.87 | 200 | No Lang |
18218 | airvistara.com | 21229 | 4.87 | 200 | HTML 5, English |
18219 | premiumtimesng.com | 21230 | 4.87 | 200 | HTML 5, English |
18220 | aldiko.com | 21231 | 4.87 | 200 | HTML 5, English |
18221 | volleybal.nl | 21232 | 4.87 | 200 | HTML 5 |
18222 | rhg.com | 21233 | 4.87 | 200 | HTML 5, English |
18223 | science20.com | 21234 | 4.87 | 200 | English |
18224 | hypixel.net | 21236 | 4.87 | 200 | HTML 5, English |
18225 | upday.com | 21238 | 4.87 | 200 | HTML 5, English |
18226 | wps.com | 21239 | 4.87 | 200 | HTML 5, English |
18227 | support.brightcove.com | 21240 | 4.87 | 200 | HTML 5, English |
18228 | live.bible.is | 21242 | 4.87 | 200 | HTML 5, No Lang |
18229 | truste.com | 21243 | 4.87 | 200 | HTML 5, English |
18230 | z.umn.edu | 21244 | 4.87 | 200 | HTML 5, English |
18231 | nessy.com | 21245 | 4.87 | 200 | HTML 5, English |
18232 | derrystrabane.com | 21246 | 4.87 | 200 | HTML 5, English |
18233 | codechef.com | 21248 | 4.87 | 200 | HTML 5, English |
18234 | incommon.org | 21250 | 4.87 | 200 | HTML 5, English |
18235 | bikeexif.com | 21251 | 4.87 | 200 | HTML 5, English |
18236 | sso.org.sg | 21253 | 4.87 | 200 | HTML 5, English |
18237 | tennis.com.au | 21254 | 4.87 | 200 | HTML 5, English |
18238 | cov-lineages.org | 21255 | 4.87 | 200 | HTML 5, English |
18239 | multichoice.com | 21256 | 4.87 | 200 | HTML 5, English |
18240 | 15five.com | 21257 | 4.87 | 200 | HTML 5, English |
18241 | sccn.ucsd.edu | 21258 | 4.87 | 200 | HTML 5, English |
18242 | fujifilm.jp | 21259 | 4.87 | 200 | HTML 5 |
18243 | masdearte.com | 21260 | 4.87 | 200 | Strict |
18244 | macquarie.com | 21261 | 4.87 | 200 | HTML 5, English |
18245 | admin.microsoft.com | 21262 | 4.87 | 200 | No Lang |
18246 | consensys.io | 21263 | 4.87 | 200 | HTML 5, English |
18247 | simonsaysstamp.com | 21264 | 4.87 | 200 | HTML 5, English |
18248 | provincia.tn.it | 21265 | 4.87 | 200 | HTML 5 |
18249 | jkrowling.com | 21267 | 4.87 | 200 | HTML 5, English |
18250 | boston.cbslocal.com | 21268 | 4.87 | 200 | HTML 5, English |
18251 | online-literature.com | 21270 | 4.87 | 200 | HTML 5, English |
18252 | edinburghnews.scotsman.com | 21271 | 4.87 | 200 | HTML 5, English |
18253 | microsoftedgeinsider.com | 21272 | 4.87 | 200 | HTML 5, English |
18254 | leprogres.fr | 21274 | 4.87 | 200 | HTML 5 |
18255 | wirexapp.com | 21275 | 4.87 | 200 | HTML 5, English |
18256 | veterans.ny.gov | 21276 | 4.87 | 200 | HTML 5, English |
18257 | electron.atom.io | 21277 | 4.87 | 200 | HTML 5, English |
18258 | blogs.pravda.com.ua | 21278 | 4.87 | 200 | HTML 5, No Lang |
18259 | cruisefever.net | 21280 | 4.87 | 200 | English |
18260 | dsal.uchicago.edu | 21281 | 4.87 | 200 | No Lang, Transitional |
18261 | dsm5.org | 21282 | 4.87 | 200 | HTML 5, English |
18262 | magnite.com | 21283 | 4.87 | 200 | HTML 5, English |
18263 | protocol.ai | 21284 | 4.87 | 200 | HTML 5, English |
18264 | living.corriere.it | 21285 | 4.87 | 200 | HTML 5 |
18265 | g.dev | 21287 | 4.87 | 200 | HTML 5, English |
18266 | tinkerpop.apache.org | 21288 | 4.87 | 200 | No Lang |
18267 | help.cbp.gov | 21290 | 4.87 | 200 | HTML 5, English |
18268 | ftp.ncbi.nlm.nih.gov | 21291 | 4.87 | 200 | No Lang |
18269 | forum.paradoxplaza.com | 21292 | 4.87 | 200 | HTML 5, English |
18270 | mobil.abus.com | 21293 | 4.87 | 200 | HTML 5, English |
18271 | bankmycell.com | 21296 | 4.87 | 200 | HTML 5, English |
18272 | bayut.com | 21297 | 4.87 | 200 | HTML 5, English |
18273 | mozzartsport.com | 21298 | 4.87 | 200 | HTML 5 |
18274 | fromsmash.com | 21299 | 4.87 | 200 | HTML 5, English |
18275 | sparefoot.com | 21300 | 4.87 | 200 | HTML 5, English |
18276 | gop.com | 21302 | 4.87 | 200 | HTML 5, English |
18277 | hoteltonight.com | 21303 | 4.87 | 200 | HTML 5, English |
18278 | yatzer.com | 21304 | 4.87 | 200 | HTML 5, English |
18279 | sahealth.sa.gov.au | 21305 | 4.87 | 200 | HTML 5, English |
18280 | tcelectronic.com | 21306 | 4.87 | 200 | HTML 5, English |
18281 | greece.greekreporter.com | 21307 | 4.87 | 200 | English |
18282 | csq.com | 21308 | 4.87 | 200 | HTML 5, English |
18283 | bdew.de | 21310 | 4.87 | 200 | HTML 5 |
18284 | 1000aircraftphotos.com | 21311 | 4.87 | 200 | No Lang |
18285 | m.jpost.com | 21312 | 4.87 | 200 | HTML 5, English |
18286 | wuppertal.de | 21313 | 4.87 | 200 | HTML 5 |
18287 | uni-hildesheim.de | 21314 | 4.87 | 200 | HTML 5 |
18288 | truecrypt.org | 21315 | 4.87 | 200 | HTML 5, English |
18289 | regionh.dk | 21316 | 4.87 | 200 | HTML 5, No Lang |
18290 | nfsa.gov.au | 21317 | 4.87 | 200 | HTML 5, English |
18291 | gettimely.com | 21318 | 4.87 | 200 | HTML 5, English |
18292 | rock-am-ring.com | 21319 | 4.87 | 200 | HTML 5 |
18293 | stattrek.com | 21320 | 4.87 | 200 | HTML 5, English |
18294 | niaaa.nih.gov | 21322 | 4.87 | 200 | HTML 5, English |
18295 | kyoto-np.co.jp | 21323 | 4.87 | 200 | HTML 5 |
18296 | datanet.co.kr | 21325 | 4.87 | 200 | HTML 5 |
18297 | designforwp.com | 21327 | 4.87 | 200 | HTML 5, English |
18298 | epsrc.ac.uk | 21328 | 4.87 | 200 | HTML 5, English |
18299 | singpost.com | 21329 | 4.87 | 200 | HTML 5, English |
18300 | environment.gov.au | 21330 | 4.87 | 200 | HTML 5, English |
Data from: Open PageRank