[racket] extracting "docstrings" from documentation

From: Danny Yoo (dyoo at cs.wpi.edu)
Date: Mon Dec 19 14:42:07 EST 2011

I'm trying to extract documentation strings for all the functions in
racket/base.  By documentation strings, I truly mean strings.  Here's
the progress I'm making on this:

    https://github.com/dyoo/extract-docstring

It's buggy still, and I'm working out the kinks.


The process I'm using to approach this is frankly a little insane, and
I would rather not go to the nuthouse for this.  I'm using setup/xref
and scribble/xref to figure out the source line and anchor of a
binding.  Next, I parse the HTML, grab at the element with the given
anchor name, and start sucking up HTML till I hit the next anchor.


I am web-scraping, and I know I should be ashamed of myself.  But I do
not see any other mechanisms available to me at the moment.  Have I
missed something obvious?


Posted on the users mailing list.