Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(6)
Cost Saving
High Dimensionality
Non-negative Matrix Factorization
Optimal Policy
State Space
Value Function
Subscribe
Academic
Publications
A novel orthogonal NMF-based belief compression for POMDPs
A novel orthogonal NMF-based belief compression for POMDPs,10.1145/1273496.1273564,Xin Li,William Kwok-wai Cheung,Jiming Liu,Zhili Wu
Edit
A novel orthogonal NMF-based belief compression for POMDPs
(
Citations: 3
)
BibTex
|
RIS
|
RefWorks
Download
Xin Li
,
William Kwok-wai Cheung
,
Jiming Liu
,
Zhili Wu
High dimensionality
of POMDP's belief
state space
is one major cause that makes the underlying
optimal policy
computation in- tractable. Belief compression refers to the methodology that projects the belief
state space
to a low-dimensional one to alleviate the problem. In this paper, we propose a novel orthogonal non-negative matrix factor- ization (O-NMF) for the projection. The proposed O-NMF not only factors the be- lief
state space
by minimizing the reconstruc- tion error, but also allows the compressed POMDP formulation to be efficiently com- puted (due to its orthogonality) in a value- directed manner so that the
value function
will take same values for corresponding belief states in the original and compressed state spaces. We have tested the proposed ap- proach using a number of benchmark prob- lems and the empirical results confirms its effectiveness in achieving substantial compu- tational
cost saving
in policy computation.
Conference:
International Conference on Machine Learning - ICML
, pp. 537-544, 2007
DOI:
10.1145/1273496.1273564
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
portal.acm.org
)
(
portal.acm.org
)
(
damas.ift.ulaval.ca
)
(
doi.acm.org
)
(
www.informatik.uni-trier.de
)
(
www.machinelearning.org
)
(
www.comp.hkbu.edu.hk
)
More »
Citation Context
(1)
...To cope with large state spaces, POMDP compression methods [10,
11
] reduce the size of the state space...
Joni Pajarinen
,
et al.
Efficient Planning in Large POMDPs through Policy Graph Based Factoriz...
References
(12)
Exact and Approximate Algorithms for Partially Observable Markov Decision Processes
(
Citations: 130
)
A. R. Cassandra
Published in 1998.
Orthogonal nonnegative matrix t-factorizations for clustering
(
Citations: 104
)
Chris H. Q. Ding
,
Tao Li
,
Wei Peng
,
Haesun Park
Conference:
Knowledge Discovery and Data Mining - KDD
, pp. 126-135, 2006
Learning the parts of objects by non-negative matrix factorization
(
Citations: 1613
)
D. D. Lee
,
H. S. Seung
Journal:
Nature
, 1999
Algorithms for Nonnegative Matrix Factorization
(
Citations: 1162
)
Daniel D. Lee
,
H. Sebastian Seung
Conference:
Neural Information Processing Systems - NIPS
, vol. 13, pp. 556-562, 2000
Decomposing Large-Scale POMDP Via Belief State Analysis
(
Citations: 3
)
Xin Li
,
William K. Cheung
,
Jiming Liu
Conference:
International Agent Technology Conference - IAT
, pp. 428-434, 2005
Sort by:
Citations
(3)
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Joni Pajarinen
,
Jaakko Peltonen
,
Ari Hottinen
,
Mikko A. Uusitalo
Conference:
Principles of Data Mining and Knowledge Discovery - PKDD
, pp. 1-16, 2010
Compressing POMDPs Using Locality Preserving NonNegative Matrix Factorization
Georgios Theocharous
,
Sridhar Mahadevan
Conference:
National Conference on Artificial Intelligence - AAAI
, 2010
Predictive State Temporal Difference Learning
Byron Boots
,
Geoffrey J. Gordon
Journal:
Computing Research Repository - CORR
, vol. abs/1011.0, 2010