I haven’t posted to PROFUNC for over a month, this is mostly my fault for not being able to keep up with such an ambitious endevour. I did receive the PROFUNC files from the Library and Archives Canada, and they are available for download, however I haven’t written up summaries of all the documents because I simply haven’t had the time to do so. What I want to do with these is a bit different than what I did in the past. The fact is that we need to be able to do a rich text search of the documents, since there are over 6000 pages, and the current open source OCR software is absolutely awful, especially when it comes to completely illegible cursive written by a Mountie.
I personally feel that this is a very important part of History, and while I’ve had an interest and a talent for history, I prefer Software Development, and hacking on stuff, so this has been sitting here waiting for someone to do the boring work of transcribing it. I’m tempted to try an experiment where people who want to transcribe this into markdown can do so on GitHub, then we go and store the documents in a database and run sphinx on it, or we do some semantic web stuff on it or something to make the documents more useful than the giant wall of text they are now.
If you are interested, please feel free to e-mail email@example.com for more info on how to contribute.