[TriEmbed] Pointers for attempting very basic Web Scraping?

Adam S. Crane ascrane at gmail.com
Mon Mar 10 10:59:32 CDT 2014


Python + BeautifulSoup (module) + Requests (module) = go fish!
On Mar 10, 2014 11:31 AM, "Shane Trent" <shanedtrent at gmail.com> wrote:

> I am looking for pointers at attempting what I hope will be a very simple
> web scraping project. Our elementary school has a newsletter that has every
> article under a separate link, requiring 15 clicks to read the whole
> newsletter. Not a great UI experience in my option. Here is an example
> newsletter.
>
>
> http://www.tmsacademy.org/index.php?option=com_content&view=category&id=120&Itemid=553
>
> I would like to find a way to get all of the newsletter content on a
> single page (and learn a few "teach a man to fish" skills). Pulling into a
> local document would be acceptable but I would like to be able to share the
> single page view with other parents at the school. I am not sure of the
> best way to do this either!
>
> A casual web-search points to Python and a few extensions but most
> references I found target data harvesting. I wonder if there is a simpler
> approach.
>
> I suspect Carl can point me in the right direction but wanted to shout-out
> to the list on the chance that someone has already done something similar.
>
> Thanks,
> Shane
>
>
>
> _______________________________________________
> Triangle, NC Embedded Computing mailing list
> TriEmbed at triembed.org
> http://mail.triembed.org/mailman/listinfo/triembed_triembed.org
> TriEmbed web site: http://TriEmbed.org
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.triembed.org/pipermail/triembed_triembed.org/attachments/20140310/4ea5ee51/attachment.htm>


More information about the TriEmbed mailing list