Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
10201 | calculator-online.net | 11900 | 5.04 | 200 | HTML 5, English |
10202 | maps.google.com.au | 11901 | 5.04 | 200 | HTML 5, English |
10203 | www2.illinois.gov | 11902 | 5.04 | 200 | HTML 5, English |
10204 | autoprefixer.github.io | 11903 | 5.04 | 200 | English |
10205 | freepik.es | 11904 | 5.04 | 200 | HTML 5 |
10206 | newsdig.tbs.co.jp | 11905 | 5.04 | 200 | HTML 5 |
10207 | afterschoolalliance.org | 11906 | 5.04 | 200 | HTML 5, English |
10208 | docs.unity3d.com | 11908 | 5.04 | 200 | HTML 5, English |
10209 | morguefile.com | 11910 | 5.04 | 200 | HTML 5, English |
10210 | lookingglassfactory.com | 11912 | 5.04 | 200 | HTML 5, English |
10211 | wfaa.com | 11914 | 5.04 | 200 | HTML 5, English |
10212 | reprap.org | 11915 | 5.04 | 200 | No Lang |
10213 | codeforamerica.org | 11917 | 5.04 | 200 | HTML 5, English |
10214 | fleetowner.com | 11918 | 5.04 | 200 | HTML 5, English |
10215 | dolphin-emu.org | 11919 | 5.04 | 200 | HTML 5, English |
10216 | neogaf.com | 11920 | 5.04 | 200 | HTML 5, English |
10217 | sidefx.com | 11921 | 5.04 | 200 | HTML 5, English |
10218 | m.dailyhunt.in | 11922 | 5.04 | 200 | HTML 5, English |
10219 | denic.de | 11923 | 5.04 | 200 | HTML 5 |
10220 | lasvegassun.com | 11924 | 5.04 | 200 | HTML 5, English |
10221 | fbreader.org | 11925 | 5.04 | 200 | HTML 5, English |
10222 | luc.edu | 11926 | 5.04 | 200 | HTML 5, English |
10223 | sintef.no | 11927 | 5.04 | 200 | HTML 5 |
10224 | clipboardjs.com | 11928 | 5.04 | 200 | HTML 5, English |
10225 | trustwave.com | 11929 | 5.04 | 200 | HTML 5, English |
10226 | thunderclap.it | 11930 | 5.04 | 200 | HTML 5, English |
10227 | biz.yelp.com | 11931 | 5.04 | 200 | HTML 5, English |
10228 | cyclos.org | 11932 | 5.04 | 200 | HTML 5, English |
10229 | efta.int | 11933 | 5.04 | 200 | HTML 5, English |
10230 | kone.com | 11934 | 5.04 | 200 | HTML 5, English |
10231 | ofgem.gov.uk | 11935 | 5.04 | 200 | HTML 5, English |
10232 | nsis.sourceforge.net | 11936 | 5.04 | 200 | HTML 5, English |
10233 | barion.com | 11937 | 5.04 | 200 | HTML 5, English |
10234 | leeds.ac.uk | 11938 | 5.04 | 200 | HTML 5, English |
10235 | ics.forth.gr | 11939 | 5.04 | 200 | HTML 5, English |
10236 | sedici.unlp.edu.ar | 11940 | 5.04 | 200 | No Lang, Strict |
10237 | flowingdata.com | 11941 | 5.04 | 200 | HTML 5, English |
10238 | nomadicmatt.com | 11942 | 5.04 | 200 | HTML 5, English |
10239 | quip.com | 11943 | 5.04 | 200 | HTML 5, English |
10240 | homeadvisor.com | 11944 | 5.04 | 200 | HTML 5, English |
10241 | islamweb.net | 11945 | 5.04 | 200 | HTML 5, No Lang |
10242 | jjie.org | 11947 | 5.04 | 200 | HTML 5, English |
10243 | freeagent.com | 11950 | 5.04 | 200 | HTML 5, English |
10244 | cmegroup.com | 11952 | 5.04 | 200 | HTML 5, English |
10245 | unwater.org | 11953 | 5.04 | 200 | HTML 5, English |
10246 | phemex.com | 11954 | 5.04 | 200 | HTML 5, English |
10247 | uswitch.com | 11955 | 5.04 | 200 | HTML 5, English |
10248 | grantland.com | 11957 | 5.04 | 200 | HTML 5, No Lang |
10249 | parismusees.paris.fr | 11958 | 5.04 | 200 | HTML 5 |
10250 | nayuki.io | 11961 | 5.04 | 200 | English |
10251 | d1.awsstatic.com | 11962 | 5.04 | 200 | No Lang |
10252 | astrograph.com | 11964 | 5.04 | 200 | HTML 5, English |
10253 | indiamart.com | 11965 | 5.04 | 200 | HTML 5, English |
10254 | avalon.law.yale.edu | 11966 | 5.04 | 200 | No Lang |
10255 | ipvanish.com | 11967 | 5.04 | 200 | HTML 5, English |
10256 | linz.govt.nz | 11968 | 5.04 | 200 | HTML 5, English |
10257 | gazeteduvar.com.tr | 11969 | 5.04 | 200 | HTML 5 |
10258 | yalemedicine.org | 11972 | 5.04 | 200 | HTML 5, English |
10259 | artincontext.org | 11973 | 5.04 | 200 | HTML 5, English |
10260 | documentfoundation.org | 11974 | 5.04 | 200 | HTML 5, English |
10261 | lovecrafts.com | 11975 | 5.04 | 200 | HTML 5, English |
10262 | px.ads.linkedin.com | 11977 | 5.04 | 200 | No Lang |
10263 | bloggingpro.com | 11978 | 5.04 | 200 | HTML 5, English |
10264 | feeld.co | 11980 | 5.04 | 200 | HTML 5, English |
10265 | conference-board.org | 11981 | 5.04 | 200 | No Lang |
10266 | catdir.loc.gov | 11982 | 5.04 | 200 | No Lang |
10267 | itic.org | 11983 | 5.04 | 200 | HTML 5, English |
10268 | e-flux.com | 11984 | 5.04 | 200 | HTML 5, English |
10269 | krakow.pl | 11985 | 5.04 | 200 | HTML 5 |
10270 | blog.twitch.tv | 11986 | 5.04 | 200 | HTML 5, English |
10271 | rosenfeldmedia.com | 11987 | 5.04 | 200 | HTML 5, English |
10272 | nibib.nih.gov | 11988 | 5.04 | 200 | HTML 5, English |
10273 | scala-lang.org | 11989 | 5.04 | 200 | HTML 5, No Lang |
10274 | alzheimers.org.uk | 11990 | 5.04 | 200 | HTML 5, English |
10275 | visibleearth.nasa.gov | 11992 | 5.04 | 200 | HTML 5, No Lang |
10276 | malaga.es | 11993 | 5.04 | 200 | HTML 5 |
10277 | isbn-international.org | 11994 | 5.04 | 200 | HTML 5, English |
10278 | guardian.ng | 11995 | 5.04 | 200 | HTML 5, English |
10279 | iris.ucl.ac.uk | 11996 | 5.04 | 200 | HTML 5, No Lang |
10280 | skylum.com | 11997 | 5.04 | 200 | HTML 5, English |
10281 | ktar.com | 11998 | 5.04 | 200 | HTML 5, English |
10282 | nailsmag.com | 11999 | 5.04 | 200 | HTML 5, English |
10283 | chinaz.com | 12000 | 5.04 | 200 | HTML 5, No Lang |
10284 | doc.qt.io | 12002 | 5.04 | 200 | English |
10285 | core77.com | 12003 | 5.04 | 200 | HTML 5, English |
10286 | de.trustpilot.com | 12004 | 5.04 | 200 | HTML 5 |
10287 | coventrytelegraph.net | 12005 | 5.04 | 200 | HTML 5, English |
10288 | boerse-frankfurt.de | 12006 | 5.04 | 200 | HTML 5 |
10289 | store.sony.com | 12007 | 5.04 | 200 | HTML 5, English |
10290 | biometricupdate.com | 12008 | 5.04 | 200 | HTML 5, English |
10291 | turismo.gal | 12009 | 5.04 | 200 | HTML 5, English |
10292 | itau.com.br | 12010 | 5.04 | 200 | HTML 5 |
10293 | blog.akismet.com | 12011 | 5.04 | 200 | HTML 5, English |
10294 | lifesize.com | 12012 | 5.04 | 200 | HTML 5, English |
10295 | engineeringtoolbox.com | 12013 | 5.04 | 200 | HTML 5, English |
10296 | rockwellautomation.com | 12014 | 5.04 | 200 | HTML 5, No Lang |
10297 | modeanalytics.com | 12016 | 5.04 | 200 | HTML 5, English |
10298 | giga.de | 12018 | 5.04 | 200 | HTML 5 |
10299 | thangs.com | 12019 | 5.04 | 200 | HTML 5, English |
10300 | lavozdegalicia.es | 12021 | 5.04 | 200 | HTML 5 |
Data from: Open PageRank