Mormon Digitization Project, resurrected — Blog

I’m resurrecting the Mormon Digitization Project, which I blogged about nine months ago and then abandoned while I went and got married. (I feel justified.)

Project page: Mormon Digitization Project

Brief recap: the goal is to find pre-1923 Mormon books (out of copyright), scan them, OCR them, clean up the OCRed text, and release the plain text files on Project Gutenberg (along with ePub editions, possibly PDFs, and possibly Lulu editions as well).

I’m starting with John A. Widtsoe’s book Joseph Smith As Scientist and will go from there. If you have any suggestions/requests, leave them in the comments (or email them to me). If I get enough people helping out, we’ll be able to tackle a few books at a time.

Process-wise, I’m thinking about trying Bite-Size Edits for at least part of the cleanup. There’s also a remote possibility I’ll use PGDP, but I really, really don’t like their interface. Right now I’m planning to track things using email and a Google Spreadsheet.

Yes, this will be kind of similar to the Mormon Documentation Project, but they don’t seem to be doing the types of books we’ll be doing. (I did use their text for the Standard Works web app and for this D&C reader’s edition I’m still working on, though.)