Extracting Structured Data from Free Text

The EDW team at Northwestern has developed a custom SQL Server Integration Services (SSIS) component which allows you to extract structured data based on configurable regular expressions. This extendable tool has already been used to implement many data marts which turn free text documentation into discrete information.

This SSIS component has been open-sourced and is available at CodePlex page for RegExtractor.

Related posts:

  1. Text De-identifier SSIS Component

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="" highlight="">