com.knowledgebooks.nlp
Class AutoTagger

java.lang.Object
  extended by com.knowledgebooks.nlp.AutoTagger

public class AutoTagger
extends java.lang.Object

Associate pre-trained classification categories (tags) with input text: assigns categories for news story types, technology category types, social information types, etc. to input text.

Copyright 2002-2008 by Mark Watson. All rights reserved.

This software is not public domain. It can be legally used under either of the following licenses:

1. KnowledgeBooks.com Non Commercial Royality Free License
2. KnowledgeBooks.com Commercial Use License

see www.knowledgebooks.com for details


Constructor Summary
AutoTagger()
           
 
Method Summary
static java.util.List<SFtriple> getTags(java.util.List<java.lang.String> stems)
           
static java.util.List<SFtriple> getTags(java.lang.String text)
           
static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems)
           
static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems, java.util.List<SFtriple> best_tags)
          Find the words that are most important for determining tags and use this information to find which words in input text are most important for summarization, semantic understanding, etc.
static float[] getWordImportanceWeights(java.lang.String text)
           
static void initializeAutoTagger(java.io.InputStream xml_input_stream)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

AutoTagger

public AutoTagger()
Method Detail

initializeAutoTagger

public static void initializeAutoTagger(java.io.InputStream xml_input_stream)
Parameters:
xml_input_stream -

getTags

public static java.util.List<SFtriple> getTags(java.lang.String text)
Parameters:
text -
Returns:

getTags

public static java.util.List<SFtriple> getTags(java.util.List<java.lang.String> stems)
Parameters:
stems -
Returns:

getWordImportanceWeights

public static float[] getWordImportanceWeights(java.lang.String text)
Parameters:
text -
Returns:

getWordImportanceWeights

public static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems)
Parameters:
stems -
Returns:

getWordImportanceWeights

public static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems,
                                               java.util.List<SFtriple> best_tags)
Find the words that are most important for determining tags and use this information to find which words in input text are most important for summarization, semantic understanding, etc.

Parameters:
stems - stems for words in text
best_tags - the best tags for this text
Returns: