Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
13901 | education.github.com | 16199 | 4.95 | 200 | HTML 5, English |
13902 | epicbrowser.com | 16201 | 4.95 | 200 | HTML 5, English |
13903 | identi.ca | 16203 | 4.95 | 200 | HTML 5, English |
13904 | lydia-app.com | 16205 | 4.95 | 200 | HTML 5 |
13905 | dot.ca.gov | 16206 | 4.95 | 200 | HTML 5, English |
13906 | getadmiral.com | 16208 | 4.95 | 200 | HTML 5, English |
13907 | people.umass.edu | 16209 | 4.95 | 200 | No Lang |
13908 | undark.org | 16210 | 4.95 | 200 | HTML 5, English |
13909 | duesseldorf.de | 16211 | 4.95 | 200 | HTML 5 |
13910 | cooksillustrated.com | 16212 | 4.95 | 200 | HTML 5, English |
13911 | buap.mx | 16213 | 4.95 | 200 | HTML 5 |
13912 | blogs.getty.edu | 16214 | 4.95 | 200 | HTML 5, English |
13913 | arb.ca.gov | 16215 | 4.95 | 200 | HTML 5, English |
13914 | fau.de | 16217 | 4.95 | 200 | HTML 5 |
13915 | all3dp.com | 16218 | 4.95 | 200 | HTML 5, English |
13916 | wtatennis.com | 16219 | 4.95 | 200 | HTML 5, English |
13917 | sweetcode.com | 16220 | 4.95 | 200 | HTML 5, English |
13918 | thehighline.org | 16221 | 4.95 | 200 | HTML 5, English |
13919 | berkshirehathaway.com | 16222 | 4.95 | 200 | No Lang |
13920 | games-workshop.com | 16223 | 4.95 | 200 | HTML 5, English |
13921 | us.wordcamp.org | 16224 | 4.95 | 200 | HTML 5, English |
13922 | ct24.cz | 16226 | 4.95 | 200 | HTML 5 |
13923 | people.cs.umass.edu | 16227 | 4.95 | 200 | HTML 5, English |
13924 | zimbabweflora.co.zw | 16228 | 4.95 | 200 | HTML 5, English |
13925 | openpsychometrics.org | 16229 | 4.95 | 200 | No Lang |
13926 | wbtw.com | 16230 | 4.95 | 200 | HTML 5, English |
13927 | wafb.com | 16231 | 4.95 | 200 | HTML 5, English |
13928 | saratogian.com | 16232 | 4.95 | 200 | HTML 5, English |
13929 | andersnoren.se | 16234 | 4.95 | 200 | HTML 5, English |
13930 | biblica.com | 16235 | 4.95 | 200 | HTML 5, English |
13931 | v2.wp-api.org | 16236 | 4.95 | 200 | HTML 5, English |
13932 | bnf.fr | 16237 | 4.95 | 200 | HTML 5 |
13933 | sgp.fas.org | 16240 | 4.95 | 200 | No Lang |
13934 | bjp.org | 16241 | 4.95 | 200 | HTML 5, English |
13935 | thetruthaboutcars.com | 16242 | 4.95 | 200 | HTML 5, English |
13936 | etsu.edu | 16244 | 4.95 | 200 | HTML 5, English |
13937 | puffingbilly.com.au | 16245 | 4.95 | 200 | HTML 5, English |
13938 | glowing.com | 16246 | 4.95 | 200 | HTML 5, English |
13939 | telex.hu | 16247 | 4.95 | 200 | HTML 5 |
13940 | atlantablackstar.com | 16248 | 4.95 | 200 | HTML 5, English |
13941 | metrorio.com.br | 16250 | 4.95 | 200 | HTML 5 |
13942 | pomodorotechnique.com | 16251 | 4.95 | 200 | HTML 5, English |
13943 | claudia.abril.com.br | 16252 | 4.95 | 200 | HTML 5 |
13944 | openbookpublishers.com | 16253 | 4.95 | 200 | HTML 5, English |
13945 | fiercebiotech.com | 16254 | 4.95 | 200 | HTML 5, English |
13946 | woocommerce.github.io | 16255 | 4.95 | 200 | HTML 5, English |
13947 | stuff.mit.edu | 16256 | 4.95 | 200 | No Lang |
13948 | app.com | 16257 | 4.95 | 200 | HTML 5, English |
13949 | question2answer.org | 16258 | 4.95 | 200 | HTML 5, English |
13950 | mec.ca | 16259 | 4.95 | 200 | HTML 5, English |
13951 | momjunction.com | 16260 | 4.95 | 200 | HTML 5, English |
13952 | e22.com | 16261 | 4.95 | 200 | HTML 5, No Lang |
13953 | wso2.com | 16262 | 4.95 | 200 | HTML 5, English |
13954 | clipartof.com | 16263 | 4.95 | 200 | HTML 5, English |
13955 | fingfx.thomsonreuters.com | 16264 | 4.95 | 200 | HTML 5, English |
13956 | wiki.openstack.org | 16265 | 4.95 | 200 | HTML 5, English |
13957 | journals.iucr.org | 16266 | 4.95 | 200 | No Lang, Transitional |
13958 | cupshe.com | 16267 | 4.95 | 200 | HTML 5, English |
13959 | research.checkpoint.com | 16268 | 4.95 | 200 | HTML 5, English |
13960 | go.hotmart.com | 16270 | 4.95 | 200 | HTML 5, English |
13961 | npic.orst.edu | 16271 | 4.95 | 200 | HTML 5, English |
13962 | kyliecosmetics.com | 16272 | 4.95 | 200 | HTML 5, English |
13963 | okcoin.com | 16273 | 4.95 | 200 | HTML 5, English |
13964 | shsu.edu | 16274 | 4.95 | 200 | HTML 5, English |
13965 | crypto.stackexchange.com | 16275 | 4.95 | 200 | HTML 5, English |
13966 | entertainment.time.com | 16276 | 4.95 | 200 | HTML 5, No Lang |
13967 | inst.eecs.berkeley.edu | 16277 | 4.95 | 200 | English |
13968 | cdn.sanity.io | 16278 | 4.95 | 200 | No Lang |
13969 | naldc.nal.usda.gov | 16279 | 4.95 | 200 | HTML 5, English |
13970 | isni.org | 16280 | 4.95 | 200 | HTML 5, English |
13971 | rivm.nl | 16281 | 4.95 | 200 | HTML 5 |
13972 | cipd.co.uk | 16282 | 4.95 | 200 | HTML 5, English |
13973 | blog.thunderbird.net | 16283 | 4.95 | 200 | HTML 5, English |
13974 | fcstpauli.com | 16285 | 4.95 | 200 | HTML 5 |
13975 | hk.linkedin.com | 16286 | 4.95 | 200 | HTML 5, English |
13976 | fileinfo.com | 16287 | 4.95 | 200 | HTML 5, English |
13977 | bsg.ox.ac.uk | 16288 | 4.95 | 200 | HTML 5, English |
13978 | wmich.edu | 16289 | 4.95 | 200 | HTML 5, English |
13979 | pictory.ai | 16290 | 4.95 | 200 | HTML 5, English |
13980 | snipboard.io | 16291 | 4.95 | 200 | HTML 5, English |
13981 | littlesunnykitchen.com | 16292 | 4.95 | 200 | HTML 5, English |
13982 | anwb.nl | 16293 | 4.95 | 200 | HTML 5 |
13983 | jeremykun.com | 16294 | 4.95 | 200 | HTML 5, English |
13984 | orgmode.org | 16295 | 4.95 | 200 | HTML 5, English |
13985 | avaibook.com | 16296 | 4.95 | 200 | HTML 5 |
13986 | citibikenyc.com | 16297 | 4.95 | 200 | HTML 5, English |
13987 | clearscope.io | 16298 | 4.95 | 200 | HTML 5, English |
13988 | adtraction.com | 16299 | 4.95 | 200 | HTML 5, English |
13989 | pro.regiondo.com | 16300 | 4.95 | 200 | HTML 5, English |
13990 | wcfia.harvard.edu | 16302 | 4.95 | 200 | HTML 5, English |
13991 | serpentinegalleries.org | 16303 | 4.95 | 200 | No Lang |
13992 | web.engr.oregonstate.edu | 16304 | 4.95 | 200 | No Lang, Transitional |
13993 | ilevia.fr | 16305 | 4.95 | 200 | HTML 5 |
13994 | apps.lucidcentral.org | 16307 | 4.95 | 200 | HTML 5, English |
13995 | go.fiverr.com | 16308 | 4.95 | 200 | No Lang |
13996 | material.google.com | 16309 | 4.95 | 200 | HTML 5, English |
13997 | eo.wikipedia.org | 16310 | 4.95 | 200 | HTML 5, No Lang |
13998 | ftc.go.kr | 16311 | 4.95 | 200 | No Lang |
13999 | 13abc.com | 16312 | 4.95 | 200 | HTML 5, English |
14000 | labs.google.com | 16313 | 4.95 | 200 | HTML 5, English |
Data from: Open PageRank