EXPERIENCE SELECTION IN REINFORCEMENT LEARNING

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250013871A1
SERIAL NO

18895583

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Techniques described herein include selecting experience data for use when training or retraining a model. In one example, this disclosure describes a method that includes generating a plurality of trajectories, each comprising a contiguous sequence of instances of experience data, where each instance of experience data in the contiguous sequence has an error value associated that instance of experience data; determining, for each of the trajectories, a sorted order of the instances of experience data, wherein the sorted order is based on the error value associated with each of the instances of experience data; selecting, based on a distribution function applied to the sorted order of the instances of experience data in at least one of the trajectories, a subset of instances of the experience data; and retraining a reinforcement learning model, using the subset of instances of experience data, to predict an optimal action to take in a state.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
WELLS FARGO BANK N A1525 W WT HARRIS BLVD MAC D1109-109 ATTN AGENCY SERVICES CHARLOTTE NC 28262

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
White, Jacob Frisco, US 18 197

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation