Fast HTML to text parser (article readability tool).
Given an HTML document, it pulls out the main body text and cleans it up.
