[plt-scheme] Parsing html : match or regexp (beginner question)

From: Bruce Butterfield (bab at entricom.com)
Date: Thu Nov 11 16:47:04 EST 2004

I recommend Neil Van Dyke's HtmlPrag which allows you to use the very 
cool ssax/sxpath tools to extract/rewrite/extend/play with whatever you 
can scrape up over the net. Very useful tools.

http://www.neilvandyke.org/htmlprag/

Thomas-Xavier MARTIN wrote:
>   For list-related administrative tasks:
>   http://list.cs.brown.edu/mailman/listinfo/plt-scheme
> 
> I'm trying to port some TCL web-scraping code to PLT-Scheme as a way to gain 
> more understanding of Scheme.
> 
> Or am I doing this all wrong ? Maybe I should read the HTML as an Xexp and use 
> the underlying structure instead of parsing a flat string. (Some of the 
> tidbits I parse for are the external links in the HTML page)
> 
> Any pointers towards enlightenment would be greatly appreciated!


Posted on the users mailing list.