Vertical Web Extractor is a project to extract the data of products or something else in congeneric websites. Developed with java ,using SWT/JFace/RCP technology.Suitable for windows and linux.It's somehow like a vertical search engine plus data extract
PDFassassin is a module for Spamassassin which scans the content of PDF attachments against the spamassassin engine and appends the resulting spam score to the overall score of the email message.