Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
19901 | simplepie.org | 23220 | 4.84 | 200 | English, Transitional |
19902 | forums.aws.amazon.com | 23221 | 4.84 | 200 | HTML 5, English |
19903 | russialist.org | 23222 | 4.84 | 200 | HTML 5, No Lang |
19904 | indiedb.com | 23223 | 4.84 | 200 | HTML 5, English |
19905 | extra.globo.com | 23224 | 4.84 | 200 | HTML 5 |
19906 | photober.com | 23226 | 4.84 | 200 | HTML 5, English |
19907 | sierra.com | 23227 | 4.84 | 200 | HTML 5, No Lang |
19908 | wildlifeacoustics.com | 23228 | 4.84 | 200 | HTML 5, English |
19909 | ysocorp.com | 23229 | 4.84 | 200 | HTML 5, English |
19910 | cogitatiopress.com | 23230 | 4.84 | 200 | HTML 5, English |
19911 | vmobil.at | 23231 | 4.84 | 200 | HTML 5 |
19912 | ezviz.com | 23232 | 4.84 | 200 | HTML 5, English |
19913 | refactoring.com | 23233 | 4.84 | 200 | No Lang, Transitional |
19914 | owlcation.com | 23234 | 4.84 | 200 | HTML 5, English |
19915 | partner.steamgames.com | 23235 | 4.84 | 200 | HTML 5, English |
19916 | elfontheshelf.com | 23236 | 4.84 | 200 | HTML 5, English |
19917 | bmwblog.com | 23237 | 4.84 | 200 | HTML 5, English |
19918 | fnal.gov | 23238 | 4.84 | 200 | HTML 5, English |
19919 | pps.org | 23239 | 4.84 | 200 | HTML 5, No Lang |
19920 | engineering.linkedin.com | 23240 | 4.84 | 200 | HTML 5, English |
19921 | ccrjustice.org | 23241 | 4.84 | 200 | HTML 5, English |
19922 | usfa.fema.gov | 23242 | 4.84 | 200 | HTML 5, English |
19923 | waappitalk.com | 23243 | 4.84 | 200 | HTML 5, English |
19924 | asyncapi.com | 23244 | 4.84 | 200 | HTML 5, English |
19925 | shopeo.cn | 23245 | 4.84 | 200 | HTML 5 |
19926 | amoerboristeria.com | 23246 | 4.84 | 200 | HTML 5 |
19927 | aberturasldo.com.ar | 23247 | 4.84 | 200 | HTML 5 |
19928 | timshedor.com | 23248 | 4.84 | 200 | HTML 5, English |
19929 | rom.on.ca | 23249 | 4.84 | 200 | HTML 5, English |
19930 | honeycomb.io | 23250 | 4.84 | 200 | HTML 5, English |
19931 | drum.lib.umd.edu | 23251 | 4.84 | 200 | HTML 5, English |
19932 | math.princeton.edu | 23252 | 4.84 | 200 | HTML 5, English |
19933 | doh.sd.gov | 23253 | 4.84 | 200 | HTML 5, English |
19934 | ru.freepik.com | 23255 | 4.84 | 200 | HTML 5 |
19935 | lorealparisusa.com | 23256 | 4.84 | 200 | HTML 5, English |
19936 | personal.ntu.edu.sg | 23258 | 4.84 | 200 | HTML 5, No Lang |
19937 | constructiondive.com | 23262 | 4.84 | 200 | HTML 5, English |
19938 | blocksandfiles.com | 23264 | 4.84 | 200 | English |
19939 | lists.gnupg.org | 23265 | 4.84 | 200 | English, Strict |
19940 | kleinerperkins.com | 23267 | 4.84 | 200 | HTML 5, English |
19941 | theness.com | 23268 | 4.84 | 200 | No Lang, Transitional |
19942 | ehjournal.biomedcentral.com | 23270 | 4.84 | 200 | HTML 5, English |
19943 | booklog.jp | 23271 | 4.84 | 200 | HTML 5 |
19944 | galaxyzoo.org | 23273 | 4.84 | 200 | HTML 5, No Lang |
19945 | e-estonia.com | 23274 | 4.84 | 200 | HTML 5 |
19946 | rgs.org | 23276 | 4.84 | 200 | HTML 5, English |
19947 | cdn.statically.io | 23277 | 4.84 | 200 | HTML 5, English |
19948 | fsharp.org | 23278 | 4.84 | 200 | HTML 5, English |
19949 | iloveimg.com | 23280 | 4.84 | 200 | HTML 5, English |
19950 | thatskygame.com | 23281 | 4.84 | 200 | HTML 5, English |
19951 | farodevigo.es | 23283 | 4.84 | 200 | HTML 5 |
19952 | citizen-times.com | 23284 | 4.84 | 200 | HTML 5, English |
19953 | stampcommunity.org | 23285 | 4.84 | 200 | HTML 5, No Lang |
19954 | bumrungrad.com | 23286 | 4.84 | 200 | HTML 5, No Lang |
19955 | broadcastnow.co.uk | 23287 | 4.84 | 200 | HTML 5, English |
19956 | forms.clickup.com | 23288 | 4.84 | 200 | HTML 5, English |
19957 | shamusyoung.com | 23290 | 4.84 | 200 | No Lang |
19958 | fussball.de | 23291 | 4.84 | 200 | HTML 5 |
19959 | mettl.com | 23295 | 4.84 | 200 | HTML 5, English |
19960 | w.wiki | 23296 | 4.84 | 200 | HTML 5, No Lang |
19961 | theccc.org.uk | 23297 | 4.84 | 200 | HTML 5, English |
19962 | restfulapi.net | 23298 | 4.84 | 200 | HTML 5, English |
19963 | poolparty.biz | 23300 | 4.84 | 200 | HTML 5, English |
19964 | barclays.co.uk | 23303 | 4.84 | 200 | HTML 5, English |
19965 | money.yahoo.com | 23304 | 4.84 | 200 | HTML 5, English |
19966 | nisra.gov.uk | 23305 | 4.84 | 200 | HTML 5, English |
19967 | rote-liste.de | 23306 | 4.84 | 200 | HTML 5, English |
19968 | dainst.org | 23307 | 4.84 | 200 | HTML 5 |
19969 | eventmobi.com | 23308 | 4.84 | 200 | HTML 5, English |
19970 | wdfw.wa.gov | 23309 | 4.84 | 200 | HTML 5, English |
19971 | frommers.com | 23310 | 4.84 | 200 | HTML 5, English |
19972 | dave.cheney.net | 23311 | 4.84 | 200 | HTML 5, English |
19973 | yuiblog.com | 23312 | 4.84 | 200 | HTML 5, English |
19974 | spring.org.uk | 23313 | 4.84 | 200 | HTML 5, English |
19975 | forum.kerbalspaceprogram.com | 23314 | 4.84 | 200 | HTML 5, English |
19976 | learnreligions.com | 23315 | 4.84 | 200 | HTML 5, English |
19977 | nova.edu | 23316 | 4.84 | 200 | HTML 5, English |
19978 | ai.baidu.com | 23317 | 4.84 | 200 | HTML 5 |
19979 | quut.com | 23318 | 4.84 | 200 | No Lang, Transitional |
19980 | codeboxr.com | 23320 | 4.84 | 200 | HTML 5, English |
19981 | focusfeatures.com | 23321 | 4.84 | 200 | HTML 5, English |
19982 | licensebuttons.net | 23322 | 4.84 | 200 | HTML 5, English |
19983 | tuya.com | 23323 | 4.84 | 200 | HTML 5, English |
19984 | go.yandex | 23324 | 4.84 | 200 | HTML 5 |
19985 | ajax.systems | 23325 | 4.84 | 200 | HTML 5, English |
19986 | agenciasinc.es | 23326 | 4.84 | 200 | HTML 5 |
19987 | vvt.at | 23328 | 4.84 | 200 | HTML 5 |
19988 | texasobserver.org | 23329 | 4.84 | 200 | HTML 5, English |
19989 | dhhr.wv.gov | 23330 | 4.84 | 200 | English, Transitional |
19990 | uk.trustpilot.com | 23331 | 4.84 | 200 | HTML 5, English |
19991 | sozialversicherung.at | 23332 | 4.84 | 200 | HTML 5 |
19992 | till.im | 23333 | 4.84 | 200 | HTML 5, English |
19993 | dubrox.github.io | 23334 | 4.84 | 200 | HTML 5, English |
19994 | freiburger-nachrichten.ch | 23335 | 4.84 | 200 | HTML 5, No Lang |
19995 | github.khronos.org | 23336 | 4.84 | 200 | HTML 5, No Lang |
19996 | opendata.aragon.es | 23337 | 4.84 | 200 | HTML 5 |
19997 | bnr.bg | 23339 | 4.84 | 200 | HTML 5 |
19998 | lci.fr | 23340 | 4.84 | 200 | HTML 5 |
19999 | ddot.dc.gov | 23341 | 4.84 | 200 | English |
20000 | wider.unu.edu | 23343 | 4.84 | 200 | HTML 5, English |
Data from: Open PageRank