[plt-scheme] Parsing html : match or regexp (beginner question)
I recommend Neil Van Dyke's HtmlPrag which allows you to use the very
cool ssax/sxpath tools to extract/rewrite/extend/play with whatever you
can scrape up over the net. Very useful tools.
http://www.neilvandyke.org/htmlprag/
Thomas-Xavier MARTIN wrote:
> For list-related administrative tasks:
> http://list.cs.brown.edu/mailman/listinfo/plt-scheme
>
> I'm trying to port some TCL web-scraping code to PLT-Scheme as a way to gain
> more understanding of Scheme.
>
> Or am I doing this all wrong ? Maybe I should read the HTML as an Xexp and use
> the underlying structure instead of parsing a flat string. (Some of the
> tidbits I parse for are the external links in the HTML page)
>
> Any pointers towards enlightenment would be greatly appreciated!