CINXE.COM
Apache UIMA - Apache UIMA
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "https://www.w3.org/TR/html4/loose.dtd"> <!-- ====================================================================== --> <!-- GENERATED FILE, DO NOT EDIT, EDIT THE XML FILE IN xdocs INSTEAD! --> <!-- ====================================================================== --> <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/> <style type="text/css">@import "stylesheets/base.css";</style> <meta name="author" value=" Apache UIMA Documentation Team "> <meta name="email" value="dev@uima.apache.org"> <title>Apache UIMA - Apache UIMA</title> </head> <body> <div class="topLogos"> <table border="0" width="100%" cellspacing="0"> <!-- TOP IMAGE --> <tr> <td align='LEFT'> <a href="index.html"> <img style="border: 1px solid black;" src="./images/UIMA_banner2tlpTm.png" alt="UIMA project logo" border="0"/> </a> </td> <td align='CENTER'> <div class="pageBanner">Apache UIMA</div> </td> <td align='RIGHT'> <a href="https://www.apache.org"> <img src="./images/asf-logo-on-white-smallTm.png" alt="Apache UIMA" border="0"/> </a> </td> </tr> </table> <hr noshade="" size="1"/> </div> <table border="0" width="100%" cellspacing="4"> <tr> <td align='RIGHT' colspan="2"> <form method="get" action="https://www.google.com/search"> Search the site <input type="text" name="q" size="25" maxlength="255" value="" /> <input type="hidden" name="sitesearch" value="https://uima.apache.org/" /> <input name="Search" value="Search Site" type="submit"/> </form> </td> </tr> <tr> <!-- LEFT SIDE NAVIGATION --> <td width="20%" valign="top"> <!-- regular menu --> <div class="navBar"> <br/> <div class="navBarItem"> <div class="navPartHeading">General</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="./index.html">Home</a> </div> <div class="navBarItem"> <a href="./downloads.cgi">Downloads</a> </div> <div class="navBarItem"> <a href="./documentation.html">Documentation</a> </div> <div class="navBarItem"> <a href="./news.html">News</a> </div> <div class="navBarItem"> <a href="./publications.html">Publications</a> </div> <br style="line-height: .5em"/> <div class="navBarItem"> <a href="https://issues.apache.org/jira/browse/uima" target="_blank" rel="noopener">Issue tracker <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="https://cwiki.apache.org/confluence/display/UIMA/" target="_blank" rel="noopener">Wiki <img src="images/offsitelink.png"/></a> </div> <br style="line-height: .5em"/> <div class="navBarItem"> <a href="https://cwiki.apache.org/confluence/display/UIMA/Powered+by+Apache+UIMA" target="_blank" rel="noopener">Powered By UIMA <img src="images/offsitelink.png"/></a> </div> </div> <br/> <div class="navBarItem"> <div class="navPartHeading">Community</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="./get-involved.html">Get Involved</a> </div> <div class="navBarItem"> <a href="./mail-lists.html">Mailing Lists</a> </div> <div class="navBarItem"> <a href="./contribution-policy.html">Contribution Policies</a> </div> <div class="navBarItem"> <a href="./faq.html">FAQ</a> </div> <div class="navBarItem"> <a href="./project-guidelines.html">Project Guidelines</a> </div> </div> <br/> <div class="navBarItem"> <div class="navPartHeading">Components & Tools</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="./sandbox.html#uima-addons-annotators">Annotators</a> </div> <div class="navBarItem"> <a href="./toolsServers.html">Tools & Servers</a> </div> <div class="navBarItem"> <a href="./sandbox.html">Addons and Sandbox</a> </div> <div class="navBarItem"> <a href="./ruta.html">UIMA Ruta</a> </div> <div class="navBarItem"> <a href="https://github.com/apache/uima-uimafit" target="_blank" rel="noopener">uimaFIT <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="./external-resources.html">External Resources</a> </div> </div> <br/> <div class="navBarItem"> <div class="navPartHeading">Development</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="./dev-quick.html">Quick Start: building</a> </div> <div class="navBarItem"> <a href="./building-uima.html">Building from Source</a> </div> <div class="navBarItem"> <a href="./one-time-setup.html">One-time setups</a> </div> <div class="navBarItem"> <a href="./svn.html">Source Code</a> </div> <div class="navBarItem"> <a href="./release.html">Doing a UIMA release</a> </div> <div class="navBarItem"> <a href="https://www.apache.org/security/committers.html" target="_blank" rel="noopener">Doing a CVE (Apache) <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="./eclipse-update-site.html">Eclipse Update Sites</a> </div> <div class="navBarItem"> <a href="./git.html">GIT</a> </div> <div class="navBarItem"> <a href="./codeConventions.html">Code Conventions</a> </div> <div class="navBarItem"> <a href="./uima-specification.html">UIMA Specification (OASIS)</a> </div> <div class="navBarItem"> <a href="./team-list.html">Project Team</a> </div> <div class="navBarItem"> <a href="./maven-design.html">Maven Use</a> </div> <div class="navBarItem"> <a href="./updating-website.html">Updating this Website</a> </div> </div> <br/> <div class="navBarItem"> <div class="navPartHeading">Events and Conferences</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="./coling14.html">COLING 2014</a> </div> <div class="navBarItem"> <a href="./gscl13.html">GSCL 2013</a> </div> <div class="navBarItem"> <a href="./iks09.html">IKS 2009</a> </div> <div class="navBarItem"> <a href="./gscl09.html">GSCL 2009</a> </div> <div class="navBarItem"> <a href="./lsm09.html">LSM 2009</a> </div> <div class="navBarItem"> <a href="./lrec08.html">LREC 2008</a> </div> <div class="navBarItem"> <a href="./gldv07.html">GLDV 2007</a> </div> </div> <br/> <div class="navBarItem"> <div class="navPartHeading">ASF</div> </div> <div class="navBar"> <div class="navBarItem"> <a href="https://www.apache.org/licenses/" target="_blank" rel="noopener">License <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="https://www.apache.org/foundation/thanks.html" target="_blank" rel="noopener">ASF Sponsors <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="https://www.apache.org/foundation/sponsorship.html" target="_blank" rel="noopener">ASF Sponsorship <img src="images/offsitelink.png"/></a> </div> <div class="navBarItem"> <a href="./security_report">Security</a> </div> </div> </div> </td> <td width="80%" align="left" valign="top"> <div class="sectionTable"> <table class="sectionTable"> <tr><td> <a name="Welcome to the Apache UIMA project"><h1><img src="images/UIMA_4sq50tightCropSolid.png"/> Welcome to the Apache UIMA project</h1></a> </td></tr> <tr><td> <blockquote class="sectionBody"> <div style="float: right; width: 220px; margin-left: 10px; border: 1px solid grey; background: white;"> <div style="text-align: center; border-bottom: 1px solid grey; margin:0.2em; padding-bottom: 0.2em;"> <a href="https://www.apache.org/events/current-event.html"> <img style="width: 210px" src="images/coc-logo-color.svg" /> </a> </div> <div style="height:105px; text-align: center; margin:0.2em; padding-bottom: 0.2em;"><a href="https://donate.apache.org"><img style="display: none" src="./images/supportapache-whspc-main_1490600386.png" title="Donate to ASF" /> <span style="display: block; padding-bottom: 50%; height: 0; background-position: center; background-size: cover; background-repeat: no-repeat; background-image: url(./images/supportapache-whspc-main_1490600386.png);" /></a></div> </div> <p> Welcome to the Apache UIMA™ project. Our goal is to support a thriving community of users and developers of UIMA frameworks, tools, and annotators, facilitating the analysis of unstructured content such as text, audio and video. </p> <h2>What is UIMA?</h2> <p> Unstructured Information Management applications are software systems that analyze large volumes of unstructured information in order to discover knowledge that is relevant to an end user. An example UIM application might ingest plain text and identify entities, such as persons, places, organizations; or relations, such as works-for or located-at. </p> <img style="float: right;" src="./images/UimaIs.png" width="430" alt="UIMA is made of many things" /> <p>UIMA enables applications to be decomposed into components, for example "language identification" => "language specific segmentation" => "sentence boundary detection" => "entity detection (person/place names etc.)". Each component implements interfaces defined by the framework and provides self-describing metadata via XML descriptor files. The framework manages these components and the data flow between them. Components are written in Java or C++; the data that flows between components is designed for efficient mapping between these languages. </p> <p>UIMA additionally provides capabilities to wrap components as network services, and can scale to very large volumes by replicating processing pipelines over a cluster of networked nodes. </p> <p> Apache UIMA is an Apache-licensed open source implementation of the UIMA specification <a href="https://www.oasis-open.org/committees/download.php/28492/uima-spec-wd-05.pdf">[pdf]</a> <a href="https://www.oasis-open.org/committees/download.php/28491/uima-spec-wd-05.doc">[doc]</a> (that specification is, in turn, being developed concurrently by a technical committee within <a href="https://www.oasis-open.org/committees/tc_home.php?wg_abbrev=uima">OASIS</a> , a standards organization). We invite and encourage you to participate in both the implementation and specification efforts. </p> <p class="nobottomspace">Here you can find:</p> <ul class="noverticalspace"> <li>Frameworks</li> <li>Components, and</li> <li>Infrastructure,</li> </ul> <p class="notopspace">all licensed under the Apache license. The dashed-line boxes above are placeholders for possible future additions. </p> <p> The <a href="doc-uima-why.html">Frameworks</a> run the components, and are available for both Java and C++. The <a href="doc-uima-examples.html">Java Framework</a> supports running both Java and non-Java components (using the C++ framework). The <a href="doc-uimacpp-huh.html">C++ framework</a>, besides supporting annotators written in C/C++, also supports Perl, Python, and TCL annotators. </p> <p>The frameworks support configuring and running pipelines of Annotator components. These components do the actual work of analyzing the unstructured information. Users can <a href="doc-uima-annotator.html">write their own annotators</a>, or configure and use pre-existing annotators. Some annotators are <a href="sandbox.html#uima-addons-annotators">available as part of this project</a>; others are contained in <a href="external-resources.html">various repositories</a> on the internet. GitHub.com lists over <a target="_blank" rel="nofollow noopener" href="https://github.com/apache/uima-uimaj/network/dependents?package_id=UGFja2FnZS0xNzk4MzkxNTI%3D">900 repositories</a> that have dependencies on the UIMA Java SDK core. </p> <p>In 2018, the internals of the UIMA Java framework were redesigned for a tighter integration with Java including for example, <a target="_blank" rel="nofollow noopener" href="https://uima.apache.org/d/uimaj-current/version_3_users_guide.html#uv3.select.terminal_form_actions.streams">Java Streams</a> support for working with annotations, and for supporting higher performance on modern processors with memory caches. This is available as Version 3.x.x of the UIMA Java SDK, and associated components. Version 2 continues to be supported, but new work should use version 3. </p> <p>An extensive rule-based scripting language (<a href="ruta.html">RUTA</a>), an associated analysis engine built on top of UIMA, and an Eclipse-based tooling workbench for interactively developing and testing the rules, is part of this project. </p> <p> Additional infrastructure support components include a <a href="sandbox.html#simple-server">simple server</a> that can receive REST requests and return annotation results, for use by other web services. </p> <p>The <a href="sandbox.html">Addons and Sandbox</a> is for Addons (Annotators and other things) for UIMA, and a place where new ideas are developed for potential incorporation into the project. </p> </blockquote> </p> </td></tr> </table> <div class="sectionTable"> <table class="sectionTable"> <tr><td> <a name="UIMA News"><h1><img src="images/UIMA_4sq50tightCropSolid.png"/> UIMA News</h1></a> </td></tr> <tr><td> <blockquote class="sectionBody"> <p>Click <a href="news.html">News</a> for a listing of releases and other news.</p> </blockquote> </p> </td></tr> </table> </td> </tr> <!-- FOOTER --> <tr><td colspan="2"> <hr noshade="" size="1"/> </td></tr> <tr><td colspan="2"> <table class="pageFooter"> <tr> <td><a href="index.html">Home</a></td> <td><a href="privacy-policy.html">Privacy Policy</a></td> <td style="font-size:75%"> Copyright © 2006-2024, The Apache Software Foundation.<br/> Apache UIMA, UIMA, the Apache UIMA logo and the Apache Feather logo are trademarks of The Apache Software Foundation.<br/> All other marks mentioned may be trademarks or registered trademarks of their respective owners. </td> <td><a href="mailto:dev@uima.apache.org">Contact us</a></td> </tr> </table> </td></tr> </table> </body> </html>