US Patent No: 5,519,608

Number of patents in Portfolio can not be more than 2000

Method for extracting from a text corpus answers to questions stated in natural language by using linguistic analysis and hypothesis generation

Stats

ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

A computerized method for organizing information retrieval based on the content of a set of primary documents. The method generates answer hypotheses based on text found in the primary documents and, typically, a natural-language input string such as a question. The answer hypotheses can include phrases or words not present in the input string. Answer hypotheses are verified and ranked based on their verification evidence. A text corpus can be queried to provide verification evidence not present in the primary documents. In another aspect the method is implemented in the context of a larger two-phase method, of which the first phase comprises the method of the invention and the second phase of the method comprises answer extraction.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
XEROX CORPORATIONSTAMFORD, CT17085

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Kupiec, Julian M Cupertino, CA 8 795

Cited Art

Patent Info (Count) # Cites Year
 
HITACHI, LTD. (2)
4,931,935 User interface system for permitting natural language interaction with an information retrieval system 67 1988
4,994,967 Information retrieval system with means for analyzing undefined words in a natural language inquiry 25 1989
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (2)
4,823,306 Text search system 134 1987
5,263,159 Information retrieval based on rank-ordered cumulative query scores calculated from weights of all keywords in an inverted index file for minimizing access to a main database 50 1990
 
HEWLETT-PACKARD COMPANY (1)
5,265,014 Multi-modal user interface 60 1992
 
TNET, INC. (1)
4,972,349 Information retrieval system and method 76 1989
 
WEST SERVICES, INC. (1)
5,265,065 Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query 287 1991
 
XEROX CORPORATION (1)
5,278,980 Iterative technique for phrase query formation and an information retrieval system employing same 245 1991

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
PHOENIX SOLUTIONS, INC. (16)
7,392,185 Speech based learning/training system using semantic decoding 32 2003
7,725,307 Query engine for processing voice based queries including semantic decoding 24 2003
7,729,904 Partial speech processing device and method for use in distributed systems 18 2004
7,702,508 System and method for natural language processing of query answers 19 2004
7,657,424 System and method for processing sentence based queries 18 2004
7,624,007 System and method for natural language processing of sentence based queries 23 2004
7,831,426 Network based interactive speech recognition system 18 2006
7,647,225 Adjustable resource based speech recognition system 16 2006
8,352,277 Method of interacting through speech with a web-connected server 0 2007
7,725,320 Internet based speech recognition system with dynamic grammars 15 2007
7,698,131 Speech recognition system for client devices having differing computing capabilities 13 2007
7,912,702 Statistical language model trained with semantic variants 13 2007
7,873,519 Natural language speech lattice containing semantic variants 20 2007
7,672,841 Method for processing speech data for a distributed recognition system 15 2008
8,229,734 Semantic decoding of user queries 0 2008
7,725,321 Speech based query system using semantic decoding 16 2008
 
GOOGLE INC. (15)
7,657,423 Automatic completion of fragments of text 7 2003
7,831,545 Identifying the unifying subject of a set of facts 3 2005
7,769,579 Learning facts from semi-structured text 1 2005
7,567,976 Merging objects in a facts database 8 2005
8,260,785 Automatic object reference identification and linking in a browseable fact repository 0 2006
8,244,689 Attribute entropy as a signal in object normalization 0 2006
7,991,797 ID persistence through normalization 0 2006
8,122,026 Finding and disambiguating references to entities on web pages 1 2006
8,347,202 Determining geographic locations for place names in a fact repository 0 2007
8,239,350 Date ambiguity resolution 0 2007
7,966,291 Fact-based object merging 3 2007
7,970,766 Entity type assignment 1 2007
8,024,178 Automatic completion of fragments of text 0 2009
8,078,573 Identifying the unifying subject of a set of facts 0 2010
8,280,722 Automatic completion of fragments of text 0 2011
 
MICROSOFT CORPORATION (11)
7,454,393 Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora 8 2003
7,519,595 Method and system for adaptive categorial presentation of search results 10 2004
7,668,791 Distinguishing facts from opinions using a multi-stage approach 1 2006
7,516,113 Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora 2 2006
8,190,627 Machine assisted query formulation 0 2007
8,346,756 Calculating valence of expressions within documents for searching a document index 0 2008
8,316,036 Checkpointing iterators during search 0 2008
8,280,721 Efficiently representing word sense probabilities 0 2008
8,229,730 Indexing role hierarchies for words in a search index 0 2008
8,229,970 Efficient storage and retrieval of posting lists 2008
8,209,321 Emphasizing search results according to conceptual meaning 0 2008
 
BUZZMETRICS, LTD., AN ISRAEL CORPORATION (10)
6,584,470 Multi-layered semiotic mechanism for answering natural language questions using document retrieval combined with information extraction 45 2001
7,725,414 Method for developing a classifier for classifying communications 5 2004
7,523,085 Topical sentiments in electronically stored communications 19 2005
8,271,316 Consumer to business data capturing system 2006
7,596,552 Method and system for extracting web data 9 2006
7,600,017 System and method for scoring electronic messages 17 2007
7,844,483 System and method for predicting external events from electronic author activity 3 2007
7,844,484 System and method for benchmarking electronic message activity 3 2007
7,877,345 Topical sentiments in electronically stored communications 4 2009
8,041,669 Topical sentiments in electronically stored communications 2 2010
 
SRI INTERNATIONAL (8)
7,069,560 Highly scalable software-based architecture for communication and cooperation among distributed electronic agents 31 1999
6,859,931 Extensible software-based architecture for communication and cooperation within and between communities of distributed agents and distributed objects 71 1999
6,691,151 Unified messaging methods and systems for communication and cooperation among distributed agents in a computing environment 142 1999
6,742,021 Navigating network-based electronic information using spoken input with multimodal error feedback 70 2000
6,513,063 Accessing network-based electronic information through scripted online interfaces using spoken input 57 2000
6,757,718 Mobile navigation of network-based electronic information using spoken input 88 2000
6,523,061 System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system 83 2000
7,036,128 Using a community of distributed electronic agents to support a highly mobile, ambient computing environment 37 2000
 
DRB LIT LTD. (5)
7,074,128 Method and system for enhancing memorization by using a mnemonic display 2 2003
RE39435 Learning system with learner-constructed response based methodology 0 2003
7,364,432 Methods of selecting Lock-In Training courses and sessions 0 2004
7,357,640 Lock-In Training system 0 2004
7,390,191 Computer system configured to sequence multi-day training utilizing a database 0 2005
 
HAPAX LIMITED (5)
6,842,730 Method and system for information extraction 28 2000
7,058,564 Method of finding answers to questions 13 2001
7,194,406 Method and system for information extraction 9 2005
7,707,023 Method of finding answers to questions 1 2006
7,657,425 Method and system for information extraction 1 2007
 
A9.COM, INC. (4)
6,973,429 Grammar generation for voice-based searches 11 2000
7,444,324 Search query processing to identify search string corrections that reflect past search query submissions of users 6 2004
7,840,577 Search query processing to identify related search terms and to correct misspellings of search terms 1 2006
7,996,398 Identifying related search terms based on search behaviors of users 0 2010
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (4)
8,340,955 System and method for finding the most likely answer to a natural language question 0 2008
8,000,957 English-language translation of exact interpretations of keyword queries 0 2008
8,140,323 Method and system for extracting information from unstructured text using symbolic machine learning 0 2009
8,417,514 System and method for finding the most likely answer to a natural language question 0 2012
 
KNAPP INVESTMENT COMPANY LIMITED (4)
6,845,370 Advanced information gathering for targeted activities 75 1998
6,134,548 System, method and article of manufacture for advanced mobile bargain shopping 303 1998
7,149,741 System, method and article of manufacture for advanced information gathering for targetted activities 25 2003
8,108,418 System, method and article of manufacture for advanced information gathering for targetted activities 1 2006
 
BROADCOM CORPORATION (3)
7,007,163 Methods and apparatus for accelerating secure session processing 6 2002
7,134,014 Methods and apparatus for accelerating secure session processing 0 2005
7,600,122 Methods and apparatus for accelerating secure session processing 0 2006
 
GO ALBERT FRANCE (3)
6,598,039 Natural language interface for searching database 51 1999
6,594,657 System and method for enhancing online support services using natural language interface for searching database 60 1999
6,446,064 System and method for enhancing e-commerce using natural language interface for searching database 32 1999
 
HASTUR LIMITED LLC (3)
6,501,937 Learning method and system based on questioning 33 1999
6,336,029 Method and system for providing information in response to questions 19 2000
6,480,698 Learning method and system based on questioning 26 2001
 
IPLEARN (3)
5,836,771 Learning method and system based on questioning 76 1996
5,884,302 System and method to answer a question 124 1997
5,934,910 Learning method and system based on questioning 63 1998
 
XEROX CORPORATION (3)
5,911,140 Method of ordering document clusters given some knowledge of user interests 25 1995
6,411,962 Systems and methods for organizing text 32 1999
7,788,084 Labeling of work of art titles in text for natural language processing 15 2006
 
ACCENTURE GLOBAL SERVICES LIMITED (2)
6,401,085 Mobile communication and computing system and method 256 1999
6,356,905 System, method and article of manufacture for mobile communication utilizing an interface support framework 270 1999
 
AOL INC. (2)
7,890,505 Filtering system for providing personalized information in the absence of negative data 0 2009
8,060,507 Filtering system for providing personalized information in the absence of negative data 0 2011
 
CONTENT ANALYST COMPANY, LLC (2)
6,678,679 Method and system for facilitating the refinement of data queries 29 2000
6,954,750 Method and system for facilitating the refinement of data queries 21 2003
 
DECERNIS, LLC (2)
8,037,018 Document validation system and method 0 2006
7,769,712 Document validation system and method 1 2007
 
FUJI XEROX CO., LTD. (2)
7,844,598 Question answering system, data search method, and computer program 0 2005
7,805,303 Question answering system, data search method, and computer program 1 2005
 
INVENTION MACHINE CORPORATION (2)
7,120,574 Synonym extension of search queries with validation 21 2001
7,962,326 Semantic answering system and method 4 2001
 
JORDAAN TECHNOLOGIES, L.L.C. (2)
7,003,509 High-dimensional data clustering with the use of hybrid similarity matrices 5 2003
7,062,508 Method and computer-based system for non-probabilistic hypothesis generation and verification 6 2003
 
JUSTSYSTEMS EVANS RESEARCH, INC. (2)
6,278,990 Sort system for text retrieval 17 1997
6,505,198 Sort system for text retrieval 1 2001
 
NIELSEN COMPANY (US), LLC, THE (2)
7,660,783 System and method of ad-hoc analysis of data 5 2007
8,347,326 Identifying key media events and modeling causal relationships between key events and reported feelings 0 2007
 
PROMPTU SYSTEMS CORPORATION (2)
8,095,370 Dual compression voice recordation non-repudiation system 0 2004
7,685,523 System and method of voice recognition near a wireline node of network supporting cable television and/or video delivery 0 2005
 
XILUNIUM CAPITAL AG, L.L.C. (2)
6,934,675 Methods and systems for enabling speech-based internet searches 17 2001
7,496,515 Methods and systems for enabling speech-based internet searches using phonemes 1 2005
 
YAHOO! INC. (2)
7,406,460 Technique for ranking records of a database 0 2004
7,809,664 Automated learning from a question and answering network of humans 3 2007
 
ADVANCED RECOGNITION TECHNOLOGIES, INC. (1)
5,982,929 Pattern recognition method and system 54 1995
 
AGENCY FOR SCIENCE, TECHNOLOGY AND RESEARCH (1)
7,346,491 Method of text similarity measurement 0 2001
 
AGILETV CORPORATION (1)
7,047,196 System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery 16 2001
 
AT&T MOBILITY II LLC (1)
8,351,581 Systems and methods for intelligent call transcription 0 2008
 
CANON KABUSHIKI KAISHA (1)
6,505,157 Apparatus and method for generating processor usable data from natural language input data 7 2000
 
CORBIS CORPORATION (1)
7,933,765 Cross-lingual information retrieval 2 2007
 
DOSSIERVIEW INC. (1)
8,060,513 Information processing with integrated semantic contexts 0 2008
 
ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE (1)
7,428,487 Semi-automatic construction method for knowledge base of encyclopedia question answering system 2 2004
 
FUJITSU LIMITED (1)
6,101,488 Intelligent information program generation and retrieval system 15 1997
 
GOVERNMENT OF THE REPUBLIC OF SINGAPORE (1)
5,930,746 Parsing and translating natural language sentences automatically 45 1996
 
IAC SEARCH & MEDIA, INC. (1)
6,584,464 Grammar template query system 129 1999
 
INFONAUTICS CORPORATION (1)
5,717,914 Method for categorizing documents into subjects using relevance normalization for documents retrieved from an information retrieval system in response to a query 115 1995
 
INTELLIGENT TEXT PROCESSING, INC. (1)
5,794,050 Natural language understanding system 265 1997
 
IP LEARN, LLC (1)
6,498,921 Method and system to answer a natural-language question 50 1999
 
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (1)
6,353,822 Program-listing appendix 43 1996
 
MD FAB. CAPITAL L.L.C. (1)
6,571,240 Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases 64 2000
 
NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (1)
7,444,279 Question answering system and question answering processing method 0 2004
 
ROBERT D. LINDNER, JR. (1)
6,865,370 Learning method and system based on questioning 9 2003
 
S.F. IP PROPERTIES 15 LLC (1)
6,185,550 Method and apparatus for classifying documents within a class hierarchy creating term vector, term file and relevance ranking 143 1997
 
SAP FRANCE (1)
8,185,509 Association of semantic objects with linguistic entity categories 0 2008
 
SAS INSTITUTE INC. (1)
7,873,657 Method and system for processing, by an information retrieval system, user input modifying the information retrieval system 3 2007
 
SUN MICROSYSTEMS, INC. (1)
6,098,066 Method and apparatus for searching for documents stored within a document directory hierarchy 79 1997
 
THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (1)
6,167,368 Method and system for indentifying significant topics of a document 61 1998
 
VODAFONE AG (1)
6,199,099 System, method and article of manufacture for a mobile communication network utilizing a distributed communication network 380 1999