The Programming Historian 2 is producing some very useful tutorials including some on Cleaning OCR’d Text with Regular Expressions. This was started by William J Turkel and others and is now supported by the Center for History and New Media. The tutorials are released under a Creative Commons so they can be copied and adapted.