OKKAM Community Portal

 
  • Increase font size
  • Default font size
  • Decrease font size
Home -> Downloads -> ENS-enabled Tools -> OKKAM4MS for Microsoft Word

OKKAM4MS for Microsoft Word

E-mail Print PDF

Environment

Microsoft Word® is a word processing software included within the Microsoft Office suite. Its first release dates back to the early eighties, the last one was in 2007. Microsoft Word is a powerful program which allows the user to create and share professional documents by using a set of writing tools and a number of utilities such as the contextual spelling checker, a thesaurus, the live word count, etc.

This tool is called Okkam4MsW, a MS Word plug-in for the Globally Unique Identification of Individuals in MS Word. This plug-in allows unique identification of entities inside text documents created with MS Word. The great circulation of MS Word as a word processor is the main reason for the choice of implementing Okkam4MSW. NLP and semantic technologies are used to detect entities in the text and extract contextual information enabling the matching decision within the OKKAM entity repository.

The plug-in can be accessed by means of a MS Word-integrated toolbar.

 

 

What it does

This paragraph focuses on the different steps of the OKKAMization process. In particular, we will analyze how OKKAM moves and processes data, in order to better understand the OKKAM Empowered tool functionality.
The following picture describes the three main steps of this workflow:

 


 

  1. The user starts writing an email
  2. The GMAIL OKKAM empowered tool takes the email’s body and sends it to the OKKAM System.


    ATTENTION: In order to perform the annotation process, the content of your email is sent to the OKKAM annotation pipeline! However, your email IS NOT STORED by OKKAM.
     

    To recognize the named entities inside the email it is fundamental to analyze the email using the OKKAM technology. This technology is very complex, based on a semantic approach and is impossible to use on the client side, so using a server side approach is needed. Before starting, the process requires you to view and accept the OKKAM privacy statement. Otherwise you can of course abort the process and no data is sent to OKKAM.

  3. OKKAM  starts to analyze the plain text coming from the email. The first step is the Entities extraction. It is clear that every tool analyzes different kinds of information. Depending on the situation, the gathering of data (ACQUISITION) can be carried out in various ways. After acquisition, the first processing step is the parsing and analysis of a text for entities extraction. In particular, the system contains three different approaches:
    1. Keyword based 

    2. Shallow linguistics

    3. Semantic Analysis

The main module is the semantic analysis based on COGITO® Discover semantic capabilities, which provides Semantic Analysis for texts in English and Italian. A query is created for each identified named entity (e.g. people, location). It contains the main features useful to better identify the potential entity in a unique way. The query is sent to the OKKAM Engine. The Entity Matching stage generates an OKKAM ID for entities, where this is required. This process is based on the following steps:
  1. Query processing
    The OKKAM Engine analyzes the query, searches the repository and returns, if present, the correct OKKAM ID for the named entity.

  2. The correct OKKAM ID identification
    The OKKAM ID resulting from matching is a value ready to be included as a new field in the document.


The use of this module is the basis for building a Web of Entities, where information about the same entity is consolidated in such a way that aggregation, integration and mash up become easier and faster.
The result you have is your email will be tagged with the OKKAMids.

 

Features

This tool allows to find and highlight entities inside your text in many different ways.
There are some different functions:

 

  • Okkamize text
    Selecting this your email will be enhanced with a little report showing, for each entity inside your text, their Okkam identifier.

  • Interactive document OKKAMization
    Users can analyze part of a text just selecting it in advance

  • Batch document OKKAMization
    Users can analyze a text in background mode

  • Interaction with OKKAM
    Users can select the correct OKKAM entity

  • Create automatically an Index inside the document

  • Export all entities in other formats

  • Remove the Okkamization
  • Search
    Links to Okkam webpage where to search for entities.

  • Add entity
    Links to Okkam webpage where to add an entity. It requires registration. http://api.okkam.org/EnsWebToolKit

  • Homepage
    Keeps links to OKKAM homepage and the pages where to search for and add entities.

  • Help
    If you need additional help, here you will find user's manual.

  • About
    Automatic upgrade detection. It keeps links to OKKAM feedback page to provide feedback.

Technology

Natural language processing and semantic technologies are used for entities detection.


Installation

Send an email to dcordioli [at] expertsystem.it
    • Download the plug-in setup
    • Follow the step by step instructions


 


Last Updated on Friday, 30 April 2010 14:03