[plt-scheme] Scheme productivity data point

From: Jon Rafkind (rafkind at cs.utah.edu)
Date: Sun Nov 8 12:38:55 EST 2009

Neil Van Dyke wrote:
> FYI, a small Scheme productivity data point that I just blogged...
>> Scheme Is Likely Fast Enough for You
>> I needed to process a corpus of HTML files, which involved a bunch of 
>> parsing, XPath-ing, simple pattern-matching, a little regexp-ing, and 
>> HTML generation.
>> Using PLT Scheme, Oleg Kiselyov's and Kirill Lisovsky's SXPath, Jim 
>> Bender's sxml-match, HtmlPrag, and WebScraperHelper, I banged it 
>> together in a day for a sample of the data, then spent another day 
>> tweaking it to run through the entire corpus. Most of the second day 
>> was due to some strangely bad HTML buried in some of the 40,000+ 
>> files, and having to juggle remote computers.
>> Afterwards, half tongue-in-cheek, I played fanboy for the benefit of 
>> a colleague who still wants to do symbolic processing in C++, 
>> laboriously...
>>> [...] The output files take less than 7% the space of the input files.

The input files were HTML and the output files were something.. but does 
that have anything to do with the fact that you used scheme? I don't see 
how the engine is a factor here, unless I missed something.

Posted on the users mailing list.