Tag Archives: Document Processing Pipeline

OpenPipeline – an open-source document processing pipeline

Most commercial search engines include a more or less advanced document processing pipeline for transforming raw input into something that can be indexed. The process involves normalization, entity extraction, linguistic processing, annotation, data cleansing etc. When it comes to Open … Continue reading

Posted in Open Source, Search technology, Technology | Tagged , , | Leave a comment