Saturday, December 26, 2015

"Understanding Big Data"

Here is a review by Jean: http://www.goodreads.com/review/show/1477106074

Wednesday, December 23, 2015

"The practice of Network Security Monitoring"

Here is a review by Jean: http://www.goodreads.com/review/show/1477096529

Saturday, September 5, 2015

I feel relatively neutral about New York

Here is a review by Jean: http://www.goodreads.com/review/show/1383974143

Sunday, May 17, 2015

Finding old treasures with data science

Almost a century and a half ago, the SS Central America sank, taking down a fortune in gold. A team decided to tackle the challenge and to use data science to determine the position of the shipwreck.

The short movie: here.



Tuesday, May 5, 2015

"Google Hacking"

Here is a review by Jean: http://www.goodreads.com/review/show/1271816531

Friday, April 24, 2015

PoS Security Is POS - RSA Conference

Byrne and Henderson did a very good talk at RSA 2015 on PoS (Point-of-Sale) security. Basically, they said that most of the vendors have little to no idea on how to embed security correctly in their products. 

From passwords that have not been changed in almost ten years, to default remote administration protocols that are not protected via poor cryptography, Byrne and Henderson showed how these vendors put everybody at risk of being a victim of card fraud.

Here are the slides. Proceed with caution, this is highly disturbing.

Saturday, February 28, 2015

Randall Munroe's "What If"

Here is a review by Jean: http://www.goodreads.com/review/show/1213527093

Monday, February 2, 2015

Global warming, python and statistics part 3



Last year and in the previous two parts, we collected and analyzed the data using various tools such as sliding averages, linear regression and sine fitting. In this last part, we will predict what the climate may look like in the future (prediction). 

So far, we have established that the current climate follows a linear trend (long term trend) superposed to a seasonal cyclic variation. The highs and lows are distributed around these in an almost normal fashion. I want to insist that the weather is not a random variable: it follows the very strict rules of physics and weather specialists use complex models based on equations from thermodynamics and fluid mechanics. However, in the long run, the climate seems to behave like a random variable whose mean varies over time.

In order to not introduce any bias in the model, the observations until the end of 2014 will be used.

Let's try to predict the weather. If you recall, a normally distributed random variable will be between μ-2σ and μ+2σ about 90% of the time. Our model is then that the temperature will be, with 90% certainty, between the trend + seasonal cycle +/- 2σ. For the first month or so of 2015, this looks like this.

The solid red lines represent the two limits (90% confidence) for the high temperatures, the two blue lines the same for the low temperatures. The dotted lines represent the average. The dots are the observed temperatures from the GHCND data set. 

On average, the year has been rather on the low side: most of the measures for the high temperatures are near or below the average, with a few measures clearly below the low line. For the low temperatures, the situation is more balanced with about the same number of points above and below the average. It is to be noted that two values are clearly below the lower confidence bound. 

As of February the 1st, we have

TemperatureMaximumMinimum
High13.3C-6.0C
Low5.0C-13.2C

The maximum between the high and low temperatures for the same day is 15.4C, the minimum 1.6C.

Conclusion

Python, numpy, scipy and matplotlib are probably the best tools to start some data exploration: python is reasonably performant, numpy and scipy provide the necessary statistical and mathematical tools, and matplotlib is a good graphic library. 

The code for generating the various graphes can be found here.

Monday, January 26, 2015

"The Imitation Game"

It is estimated by historians that British mathematician Alan Turing helped, through his contributions at Bletchley Park, shorten World War II by at least two years. This movie his about that period of his life, though the beginning is set at the time of his arrest for gross indecency", the charge then in vigor in the UK for being homosexual.

The movie does not pretend to be an historical account nor a documentary on Turing's life, and as such viewers familiar with the "real" Turing will found some elements to be inaccurate. It is however a good story that proves I thoroughly enjoyed.

And the violent content was very limited, for a change.

Monday, January 5, 2015

"Flatland"

Here is a review by Jean: http://www.goodreads.com/review/show/447710813

Sunday, January 4, 2015

"Data Mining Tools for Malware Detection"

Here is a review by Jean: http://www.goodreads.com/review/show/1054850667