BigSnarf blog

Infosec FTW

Get Moar Data or Tough Luck


Lambda Architecture – Redis/Postgres



Speed Layer (Blue):

Query (Green Output):

Batch Layer (Purple):

  • S3 or Postgres
  • Luigi/Cro
  • Prediction Models Trained
  • Postgres

TensorFlow and Kaggle

Screen Shot 2016-04-17 at 12.02.46 AM

Screen Shot 2016-04-17 at 12.04.35 AM

TensorFlow explained

Anomaly detection with Bayesian networks

Anomaly detection, also known as outlier detection, is the process of identifying data which is unusual. I have been using basic python Markov Chains or more complex python MCMC.

Anomaly detection can also be used to detect unusual time series. Bayesian networks are well suited for anomaly detection, because they can handle high dimensional data, which humans find difficult to interpret.

One typical way we can use data visualizations to identify some anomalies and these are clearly visible by plotting individual variables. More often anomalies are far more subtle, and are based on the interaction of many variables.


Screen Shot 2016-04-10 at 9.12.48 AMScreen Shot 2016-04-10 at 9.07.25 AM

Here is a nice notebook on python mcmc:

I haven’t read the previous blog post on FFT. There are lots of time series analysis.

An interesting method for detection of patterns is using “Shape Search”:

Screen Shot 2016-04-10 at 8.58.20 AM


But I think there are interesting things using signal processing as well for AD like Median Filter.

Real Time Analytics with PostgreSQL

Screen Shot 2016-04-02 at 10.38.53 PMScreen Shot 2016-03-29 at 8.18.46 AM

Screen Shot 2016-04-02 at 8.48.39 AM


RF, SVM, KNN ensembles training

Screen Shot 2016-03-26 at 12.24.10 AM

Spark OLAP

STL for anomaly detection

Monitor Cloudtrail logs with this Python Slackbot – Who is using AWS Console?


Get every new post delivered to your Inbox.

Join 53 other followers