[TriEmbed] Pointers for attempting very basic Web Scraping?

Shane Trent shanedtrent at gmail.com
Mon Mar 10 10:31:46 CDT 2014


I am looking for pointers at attempting what I hope will be a very simple
web scraping project. Our elementary school has a newsletter that has every
article under a separate link, requiring 15 clicks to read the whole
newsletter. Not a great UI experience in my option. Here is an example
newsletter.

http://www.tmsacademy.org/index.php?option=com_content&view=category&id=120&Itemid=553

I would like to find a way to get all of the newsletter content on a single
page (and learn a few "teach a man to fish" skills). Pulling into a local
document would be acceptable but I would like to be able to share the
single page view with other parents at the school. I am not sure of the
best way to do this either!

A casual web-search points to Python and a few extensions but most
references I found target data harvesting. I wonder if there is a simpler
approach.

I suspect Carl can point me in the right direction but wanted to shout-out
to the list on the chance that someone has already done something similar.

Thanks,
Shane
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.triembed.org/pipermail/triembed_triembed.org/attachments/20140310/bb496618/attachment.htm>


More information about the TriEmbed mailing list