Building a translation lexicon from comparable, non-parallel corpora

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 8234106
APP PUB NO 20100042398A1
SERIAL NO

12576110

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corpora, and use them as a seed lexicon. The system may use various clues, e.g., context and frequency, to identify and score other possible translation pairs, using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
UNIVERSITY OF SOUTHERN CALIFORNIA1150 SOUTH OLIVE STREET SUITE 2300 LOS ANGELES CA 90015

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Knight, Kevin Hermosa Beach, US 40 2270
Koehn, Philipp Venice, US 7 732
Marcu, Daniel Hermosa Beach, US 37 2979
Munteanu, Dragos Stefan Los Angeles, US 6 377

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation