I recently gave a talk at PyCon Canada 2015 in Toronto about the Epic NHL goal celebration hack with a hue light show and real-time machine learning blog post.
In a hurry? Go straight to the graphs. The dataset and notebook detailing how this was done are available in the companion repository. Two weeks ago was Back To The Future Day. October 21st, 2015 is the day Marty and Doc Brown travel to at the beginning of the second movie. The future is now the past. There were worldwide celebrations and jokes, from the … Continue reading Actually, Marty didn’t go Back To The Future: Graphing the train sequence of BTTF3
The 2015 Canadian federal election is in its final stretch and college and I thought it would be a great opportunity to collect some data and do some machine learning. Citizen data science in action! We looked at the press releases of non-regional Canadian federal political parties using Datacratic’s Machine Learning Database (MLDB). The image below is a map with 620 dots each representing one … Continue reading Mapping Press Releases in the 2015 Canadian Federal Election
See media coverage of this blog post. In Montréal this time of year, the city literally stops and everyone starts talking, thinking and dreaming about a single thing: the Stanley Cup Playoffs. Even most of those who don’t normally care the least bit about hockey transform into die hard fans of the Montréal Canadiens, or the Habs like we also call them. Below is a … Continue reading Hacking an epic NHL goal celebration with a hue light show and real-time machine learning
Je viens de publier un article, MapReduce avec parallel, cat et une redirection, sur le blogue de Datacratic.
I’ve recently learned a couple of neat tricks to process large amounts of text files more efficiently from my new co-worker @nicolaskruchten. Our use-case is efficiently going through tens of gigabytes of logs to extract specific lines and do some operation on them. Here are a couple of things we’ve done to speed things up. Keep everything gziped Often, the bottleneck will be IO. This … Continue reading Efficient log processing
Getting a 64-bit installation of Python with scientific packages on our dear Windows isn’t as simple as running an apt-get or port command. There is an official 64-bit Python build available but extensions like numpy, scipy or matplotlib only have official 32-bit builds. There are commercial distributions such as Enthought that offer all the packages built in 64-bit but at around 200$ per license, this … Continue reading 64-bit Scientific Python on Windows
I was thrilled to attend the Boston Music Hackday this week-end. A lot of people hacked up some pretty cool projects, many of us coding until the very early morning Sunday (aka 4am), only to get back up a few hours later (aka 8am) to keep at it until the dreaded 15h45 deadline, when we all had to submit our demos. The organisers did a … Continue reading Boston Music Hackday
My internship at Sun Microsystems Labs, which has been going on for about 15 months – 9 of those full time at their campus in the Boston area – is coming to an end. During the course of those months, I’ve met a lot of very smart and fun people, I’ve worked on very challenging and stimulating problems and I’ve discovered a bunch of really … Continue reading My time at Sun Labs and pyaura