BigSnarf blog

Infosec FTW

Category Archives: Thoughts

Streaming Prototype

Apache Spark Use Casesstreaming-arch

Our specific use case

Screen Shot 2015-05-23 at 11.18.22 AM

Kinesis gets raw logs

Screen Shot 2015-05-21 at 5.14.41 PM

Spark Streaming does the counting

Screen Shot 2015-05-21 at 5.14.04 PM

Two Tables Created, One for Kinesis Log Position and the Second for Aggregates

Screen Shot 2015-05-23 at 10.40.15 AM

DynamoDB stores the aggregations

Screen Shot 2015-05-21 at 5.12.11 PM

Amazon introduces ML service

Face Detection Rasperry Pi 2 Day

DataFrames meet Apache Spark 1.3

Spark Scala Notebook incubating Apache video

Norvig on Machine Learning

Happy Pancake Stack

Brute Force D3.js Visualization

Running Apache Spark EMR and EC2 scripts on AWS with read write S3

Video Demo of Spark on EMR

Other posts I did learning EMR

https://bigsnarf.wordpress.com/2014/10/22/process-logs-with-kinesis-s3-apache-spark-on-emr-amazon-rds/

https://bigsnarf.wordpress.com/2015/01/05/apache-spark-1-0-0-emr-via-command-line/

Script to launch you own cluster on EC2

Spark Cluster Build Output for EC2

Commands to experiment with Spark Shell and read write to S3

Output for Simple Word Count job on EMR

Screen Shot 2015-01-20 at 4.42.11 PM

Links to Apache Spark and Collection of Spark EMR Posts

Rsyslog Remotes sending to ElasticSearch and Kibana

Follow

Get every new post delivered to your Inbox.

Join 50 other followers