Setting up Macbook Pro for Development

As I’m building a new startup I’ve been installing several Macbooks and helping engineers on their own setup, of course, we all have our kinks on configuring our machines but there is a base that would be nice to share and keep as standard as possible. Those are my (hopefully sane) defaults.

Continue Reading »

Should a backend developer learn Javascript?

I’ve been working with static-typed languages for several years now, C# / Java / Scala developer and I like the safety and guarantees of having type checking, also the whole JVM ecosystem aged well with great building tools, libraries and lots of experience from the community. But I cannot ignore all the buzz around Javascript and I don’t want to be prejudiced about something I didn’t work directly for several years.

This is a very overdue blog post that I had on draft for almost an year, but I believe it is still very relevant yet.

Continue Reading »

A data science toolkit inside a docker image, build it once, run everywhere

If you never heard about Jupyter Notebook, I highly recommend you to check it out. It have been my primary platform to build reports and data driven case studies. On this post I’d like to show how I create a simple and isolated environment with a Bash script and Docker to run JupyterLab. Recently Jupyter Notebook received a major overhauling and become JupyterLab - currently in beta, but the new platform looks fresh and very powerful. Continue Reading »

Reasons to fall in love for Postgres

I’ve been working on analytics/big data field for 10+ years, during this time I’ve been working mostly with MySQL, MongoDB, Redis and Cassandra. Just a couple of years ago I started to really pay attention to Postgres, and my regret is not getting into it earlier… On this post I try to enumerate a few features I’m using and why I think you should try it too, before jumping into the architectural and operational complexity of multiple NoSQL. Continue Reading »

Which is the best tool for copying a large directory tree locally?

Recently we had to move a full Cassandra backup to another cluster of machines (another Datacenter on Cassandra’s jargon). Although it can be achieved using DC replication we opted for a more conservative approach and not change production configurations neither increase its load due data streaming. This post is quick comparison to find out which tool would perform better for copying a large directory tree locally. The Data One of our Cassandra’s clusters contains 12 nodes, each node has 532Gb of data distributed among 1,753,200 files (the /var/lib/cassandra folder). Continue Reading »

Instalando Datastax Analytics (Cassandra y Spark) con Azure Templates

La última semana tuve la oportunidad de contar la experiencia de Socialmetrix instalando y configurando clusters de Datastax Analytics en Azure. Datastax brinda una solución comercial en un bundle, conteniendo Cassandra, Spark y Solr integrados. Las charlas se dieron en Argentina Big Data Meetup. Hosted by Jampp y el Nardoz Meetup. Hosted by Medallia

Continue Reading »