Je viens de publier un article, MapReduce avec parallel, cat et une redirection, sur le blogue de Datacratic.
I’ve recently learned a couple of neat tricks to process large amounts of text files more efficiently from my new co-worker @nicolaskruchten. Our use-case is efficiently going through tens of gigabytes of logs to extract specific lines and do some operation on them. Here are a couple of things we’ve done to speed things up. Keep everything gziped Often, the bottleneck will be IO. This … Continue reading Efficient log processing
OSX’s top program doesn’t quite behave like its Linux counterpart out of the box. For me, the two biggest problems are that processes aren’t sorted by CPU usage and the top program itself uses 10% of the CPU because it calculates all sorts of statistics about memory and shared library usage that I personally don’t care about. There are a series of flags that you … Continue reading Make OSX’s top behave like Linux’s top
My old D-Link router, like pretty much every other router I’ve ever owned, wasn’t very reliable in some way and so I was looking for open-source alternative firmwares like Tomato to flash it with. With the clear lack of effort put into the official firmwares, I thought it couldn’t hurt to try. Unfortunately, my router wasn’t supported by any third party firmware. During my search, … Continue reading pfSense : a software alternative to your old router/firewall
A good friend of mine, and fellow trekkie, showed me something very interesting in the unicode man page. (Type man unicode on a unix system, or you can get it here) UCS contains the characters required to represent practically all known languages. This includes not only the Latin, Greek, Cyrillic, Hebrew, Arabic, Armenian, and Georgian scripts, but also also Chinese, Japanese and Korean Han ideographs … Continue reading Interesting addition to unicode