I'm the founder of Snowtide Informatics. We make DocuHarvest, a web application that turns your valuable documents into data, and PDFTextStream, a PDF text extraction library for Java and .NET. I do a lot of programming in Clojure and just a little in Java, trying to make it easier for people to make unstructured content just a little more useful.
I host an occasional podcast you might want to check out:
Strictly Professional » Podcasts (Podcasting about programming, the business of software, and otherwise making fools of ourselves.)