There are also a couple of features that are added.
- Ability to scrape Paul Graham's essays. He doesn't follow web standards, and thus Hpricot isn't able to extract information easily (like using
for new paragraph instead of wrapping it with tags). I had to implement special hack just for Paul. - DB indexes and HTTP caching. In alpha stage, I don't know how much it would matter, but I am preparing for happy problems.
0 comments:
Post a Comment