BigSnarf blog

Infosec FTW

Redis, cPickle, iPython, linkedin scraper, parsing data

Screen Shot 2013-02-10 at 7.32.39 PM

https://github.com/bigsnarfdude/webscraping/blob/master/linkedin_scaper.py

Building parsers and I didn’t even know it. I was reviewing search terms used that people use to find my blogs. I’ve never really understood building a parser, but I understood taking an input and doing something with it. I guess I will formally learn what parsing is this week. http://sigusr2.net/2011/Apr/18/parser-combinators-made-simple.html http://www.mollypages.org/page/grammar/index.mp

String Patterns

Finding and specifying classes of strings using regular expressions

Lexical Analysis

Breaking strings down into important words

Grammars

Specifying and deconstructing valid sentences

Parsing

Turning sentences into trees

http://www.youtube.com/watch?v=6TmNX1ZON6k&list=ECBF6FC32358457242

Parser_Flow

Other Python Parsing Tools

Check out these links for more information about parsing in Python:

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: