com.knowledgebooks.nlp
Class AutoTagger
java.lang.Object
com.knowledgebooks.nlp.AutoTagger
public class AutoTagger
- extends java.lang.Object
Associate pre-trained classification categories (tags) with input text: assigns
categories for news story types, technology category types, social information
types, etc. to input text.
Copyright 2002-2008 by Mark Watson. All rights reserved.
This software is not public domain. It can be legally
used under either of the following licenses:
1. KnowledgeBooks.com Non Commercial Royality Free License
2. KnowledgeBooks.com Commercial Use License
see www.knowledgebooks.com for details
|
Method Summary |
static java.util.List<SFtriple> |
getTags(java.util.List<java.lang.String> stems)
|
static java.util.List<SFtriple> |
getTags(java.lang.String text)
|
static float[] |
getWordImportanceWeights(java.util.List<java.lang.String> stems)
|
static float[] |
getWordImportanceWeights(java.util.List<java.lang.String> stems,
java.util.List<SFtriple> best_tags)
Find the words that are most important for determining tags and use
this information to find which words in input text are most important for
summarization, semantic understanding, etc. |
static float[] |
getWordImportanceWeights(java.lang.String text)
|
static void |
initializeAutoTagger(java.io.InputStream xml_input_stream)
|
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
AutoTagger
public AutoTagger()
initializeAutoTagger
public static void initializeAutoTagger(java.io.InputStream xml_input_stream)
- Parameters:
xml_input_stream -
getTags
public static java.util.List<SFtriple> getTags(java.lang.String text)
- Parameters:
text -
- Returns:
getTags
public static java.util.List<SFtriple> getTags(java.util.List<java.lang.String> stems)
- Parameters:
stems -
- Returns:
getWordImportanceWeights
public static float[] getWordImportanceWeights(java.lang.String text)
- Parameters:
text -
- Returns:
getWordImportanceWeights
public static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems)
- Parameters:
stems -
- Returns:
getWordImportanceWeights
public static float[] getWordImportanceWeights(java.util.List<java.lang.String> stems,
java.util.List<SFtriple> best_tags)
- Find the words that are most important for determining tags and use
this information to find which words in input text are most important for
summarization, semantic understanding, etc.
- Parameters:
stems - stems for words in textbest_tags - the best tags for this text
- Returns: