Shlomo Zilberstein

Professor, Associate Dean for Research and Engagement

shlomo@cs.umass.edu

Phone: (413) 545-4189

A359 LGRC

Research

We study a wide range of problems in artificial intelligence, automated planning and learning, autonomous systems, reasoning under uncertainty, multi-agent systems, and resource-bounded reasoning. We are particularly interested in the implications of uncertainty and limited computational resources on the design of autonomous agents. In most practical settings, it is not feasible or desirable to find the optimal action, making it necessary to resort to some form of approximate reasoning. This raises a fundamental question: what does it mean for an agent to be “rational” when it does not have enough knowledge or computational power to derive the best course of action? Our overall approach to this problem involves meta-level control mechanisms that reason explicitly about the cost of decision-making and can optimize the amount of deliberation (or “thinking”) an agent does before taking action. We have also developed new planning techniques for situations involving multiple decision makers operating in either collaborative or adversarial domains.

Human Compatible AI

How can we design AI systems that are compatible with human needs: accountable, explainable, equitable, ethical, and mindful of human cognitive biases and shortcomings?

Show Related Publications

Miura, Shuwa; Zilberstein, Shlomo

Observer-Aware Planning with Implicit and Explicit Communication Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Vazquez-Chanlatte, Marcell; Witwicki, Stefan; Zilberstein, Shlomo

Explaining the Behavior of POMDP-based Agents Through the Impact of Counterfactual Information Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 37th International FLAIRS Conference, Miramar Beach, Florida, 2024.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Nashed, Samer B.; Goldman, Claudia V.; Zilberstein, Shlomo

Estimating Causal Responsibility for Explaining Autonomous Behavior Book Section

In: Calvaresi, Davide (Ed.): International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS), pp. 78–94, Springer, 2023.

Abstract | Links | BibTeX

@incollection{SZ:MNGZextraamas23,

title = {Estimating Causal Responsibility for Explaining Autonomous Behavior},

author = {Saaduddin Mahmud and Samer B. Nashed and Claudia V. Goldman and Shlomo Zilberstein},

editor = {Davide Calvaresi},

url = {http://rbr.cs.umass.edu/shlomo/papers/MNGZextraamas23.pdf},

doi = {10.1007/978-3-031-40878-6},

year  = {2023},

date = {2023-01-01},

booktitle = {International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS)},

pages = {78–94},

publisher = {Springer},

abstract = {There has been growing interest in causal explanations of stochastic, sequential decision-making systems. Structural causal models and causal reasoning offer several theoretical benefits when exact inference can be applied. Furthermore, users overwhelmingly prefer the resulting causal explanations over other state-of-the-art systems. In this work, we focus on one such method, MeanRESP, and its approximate versions that drastically reduce compute load and assign a responsibility score to each variable, which helps identify smaller sets of causes to be used as explanations. However, this method, and its approximate versions in particular, lack deeper theoretical analysis and broader empirical tests. To address these shortcomings, we provide three primary contributions. First, we offer several theoretical insights on the sample complexity and error rate of approximate MeanRESP. Second, we discuss several automated metrics for comparing explanations generated from approximate methods to those generated via exact methods. While we recognize the significance of user studies as the gold standard for evaluating explanations, our aim is to leverage the proposed metrics to systematically compare explanation-generation methods along important quantitative dimensions. Finally, we provide a more detailed discussion of MeanRESP and how its output under different definitions of responsibility compares to existing widely adopted methods that use Shapley values.},

keywords = {},

pubstate = {published},

tppubtype = {incollection}

}

Saisubramanian, Sandhya; Zilberstein, Shlomo; Kamar, Ece

Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems Journal Article

In: AI Magazine, vol. 42, no. 4, pp. 62–71, 2022.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo; Kamar, Ece

Avoiding Negative Side Effects of Autonomous Systems in the Open World Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 74, pp. 143–177, 2022.

Abstract | Links | BibTeX

@article{SZ:SZKjair22,

title = {Avoiding Negative Side Effects of Autonomous Systems in the Open World},

author = {Sandhya Saisubramanian and Shlomo Zilberstein and Ece Kamar},

url = {https://www.jair.org/index.php/jair/article/view/13581/26799},

doi = {10.1613/jair.1.13581},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {74},

pages = {143--177},

abstract = {Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.

Miura, Shuwa; Wray, Kyle Hollins; Zilberstein, Shlomo

Heuristic Search for SSPs with Lexicographic Preferences over Multiple Costs Conference

Proceedings of the 15th Annual Symposium on Combinatorial Search (SOCS), Vienna, Austria, 2022.

Abstract | Links | BibTeX

Svegliato, Justin; Nashed, Samer B; Zilberstein, Shlomo

Ethically Compliant Sequential Decision Making Conference

Proceedings of the 35th Conference on Artificial Intelligence (AAAI), 2021, (Distinguished Paper Award).

Abstract | Links | BibTeX

Miura, Shuwa; Cohen, Andrew L; Zilberstein, Shlomo

Maximizing Legibility in Stochastic Environments Conference

Proceedings of the 30th IEEE International Conference on Robot & Human Interactive Communication, (RO-MAN), Vancouver, BC, Canada, 2021.

Abstract | Links | BibTeX

Miura, Shuwa; Zilberstein, Shlomo

A Unifying Framework for Observer-Aware Planning and its Complexity Conference

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (UAI), Virtual Event, 2021.

Abstract | Links | BibTeX

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning via Introspective Perception Journal Article

In: CoRR, vol. abs/2109.13974, 2021.

Abstract | Links | BibTeX

Nashed, Samer B; Svegliato, Justin; Zilberstein, Shlomo

Ethically Compliant Planning within Moral Communities Conference

Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021.

Abstract | Links | BibTeX

Galhotra, Sainyam; Saisubramanian, Sandhya; Zilberstein, Shlomo

Learning to Generate Fair Clusters from Demonstrations Conference

Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021.

Abstract | Links | BibTeX

Galhotra, Sainyam; Saisubramanian, Sandhya; Zilberstein, Shlomo

Learning to Generate Fair Clusters from Demonstrations Journal Article

In: CoRR, vol. abs/2102.03977, 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Mitigating Negative Side Effects via Environment Shaping Journal Article

In: CoRR, vol. abs/2102.07017, 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Mitigating Negative Side Effects via Environment Shaping (Extended Abstract) Conference

Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Roberts, Shannon C; Zilberstein, Shlomo

Understanding User Attitudes Towards Negative Side Effects of AI Systems Conference

CHI Conference on Human Factors in Computing Systems, Late-Breaking Work, 2021.

Abstract | Links | BibTeX

Woolf, Beverly; Ghosh, Aritra; Lan, Andrew; Zilberstein, Shlomo; Juravich, Tom; Cohen, Andrew; Geho, Olivia

AI-Enabled Training in Manufacturing Workforce Development Conference

AAAI Spring Symposium on Artificial Intelligence in Manufacturing, 2020.

Abstract | BibTeX

Renski, Henry; Smith-Doerr, Laurel; Wilkerson, Tiamba; Roberts, Shannon C; Zilberstein, Shlomo; Branch, Enobong H

Racial Equity and the Future of Work Journal Article

In: Technology| Architecture+ Design, vol. 4, no. 1, pp. 17–22, 2020.

Anytime Algorithms

How can we design “well behaved” algorithms that can be interrupted at any time and still return useful results, and how can we use such algorithms as components of a complex AI system?

Show Related Publications

Bhatia, Abhinav; Svegliato, Justin; Nashed, Samer B.; Zilberstein, Shlomo

Tuning the Hyperparameters of Anytime Planning: A Metareasoning Approach with Deep Reinforcement Learning Conference

Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS), Virtual Conference, 2022.

Abstract | Links | BibTeX

Bhatia, Abhinav; Svegliato, Justin; Zilberstein, Shlomo

On the Benefits of Randomly Adjusting Anytime Weighted A* Conference

Proceedings of the 14th International Symposium on Combinatorial Search (SOCS), 2021.

Abstract | Links | BibTeX

Svegliato, Justin; Wray, Kyle Hollins; Zilberstein, Shlomo

Meta-Level Control of Anytime Algorithms with Online Performance Prediction Conference

Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 2018.

Abstract | Links | BibTeX

Svegliato, Justin; Zilberstein, Shlomo

Adaptive Metareasoning for Bounded Rational Agents Conference

IJCAI/ECAI Workshop on Architectures and Evaluation for Generality, Autonomy and Progress in AI (AEGAP), Stockholm, Sweden, 2018.

Abstract | Links | BibTeX

Arnt, Andrew; Zilberstein, Shlomo; Allan, James; Mouaddib, Abdel-Illah

Dynamic Composition of Information Retrieval Techniques Journal Article

In: Journal of Intelligent Information Systems (JIIS), vol. 23, no. 1, pp. 67–97, 2004.

Abstract | Links | BibTeX

Zilberstein, Shlomo; Charpillet, Francois; Chassaing, Philippe

Optimal Sequencing of Contract Algorithms Journal Article

In: Annals of Mathematics and Artificial Intelligence (AMAI), vol. 39, no. 1-2, pp. 1-18, 2003.

Abstract | Links | BibTeX

Bernstein, Daniel S; Finkelstein, Lev; Zilberstein, Shlomo

Contract Algorithms and Robots on Rays: Unifying Two Scheduling Problems Conference

Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI), Acapulco, Mexico, 2003.

Abstract | Links | BibTeX

Bernstein, Daniel S; Perkins, Theodore J; Zilberstein, Shlomo; Finkelstein, Lev

Scheduling Contract Algorithms on Multiple Processors Conference

Proceedings of the 18th National Conference on Artificial Intelligence (AAAI), Edmonton, Alberta, 2002.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

Monitoring and Control of Anytime Algorithms: A Dynamic Programming Approach Journal Article

In: Artificial Intelligence (AIJ), vol. 126, no. 1-2, pp. 139–157, 2001.

Abstract | Links | BibTeX

Grass, Joshua; Zilberstein, Shlomo

A Value-Driven System for Autonomous Information Gathering Journal Article

In: Journal of Intelligent Information Systems (JIIS), vol. 14, no. 1, pp. 5–27, 2000.

Abstract | Links | BibTeX

Zilberstein, Shlomo; Charpillet, Francois; Chassaing, Philippe

Real-Time Problem-Solving with Contract Algorithms Conference

Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, 1999.

Abstract | Links | BibTeX

Marengoni, Mauricio; Hanson, Allen; Zilberstein, Shlomo; Riseman, Edward

Control in a 3D Reconstruction System Using Selective Perception Conference

Proceedings of the 7th IEEE International Conference on Computer Vision (ICCV), Kerkyra, Greece, 1999.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo; Danilchenko, Victor A

Anytime Heuristic Search: First Results Technical Report

Computer Science Department, University of Massachussetts Amherst no. 97-50, 1997.

Abstract | Links | BibTeX

Zilberstein, Shlomo; Russell, Stuart J

Optimal Composition of Real-Time Systems Journal Article

In: Artificial Intelligence (AIJ), vol. 82, no. 1-2, pp. 181–213, 1996.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Using Anytime Algorithms in Intelligent Systems Journal Article

In: AI Magazine, vol. 17, no. 3, pp. 73–83, 1996.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

Monitoring the Progress of Anytime Problem-Solving Conference

Proceedings of the 13th National Conference on Artificial Intelligence (AAAI), Portland, Oregon, 1996.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Optimizing Decision Quality with Contract Algorithms Conference

Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), Montreal, Canada, 1995.

Abstract | Links | BibTeX

Mouaddib, Abdel-illah; Zilberstein, Shlomo

Knowledge-Based Anytime Computation Conference

Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), Montreal, Canada, 1995.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Operational Rationality through Compilation of Anytime Algorithms PhD Thesis

Computer Science Division, University of California Berkeley, 1993.

Abstract | Links | BibTeX

@phdthesis{SZ:Zshort93,

title = {Operational Rationality through Compilation of Anytime Algorithms},

author = {Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/Zshort93.pdf},

year = {1993},

date = {1993-01-01},

school = {Computer Science Division, University of California Berkeley},

abstract = {An important and largely ignored aspect of real-time decision making is the capability of agents to factor the cost of deliberation into the decision making process. I have developed an efficient model that creates this capability. The model uses as basic components anytime algorithms whose quality of results improves gradually as computation time increases. The main contribution of this work is a compilation process that extends the property of gradual improvement from the level of single algorithms to the level of complex systems.

In standard algorithms, the fixed quality of the output allows for composition to be implemented by a simple call-return mechanism. However, when algorithms have resource allocation as a degree of freedom, there arises the question of how to construct, for example, the optimal composition of two anytime algorithms, one of which feeds its output to the other. This scheduling problem is solved by an off-line compilation process and a run-time monitoring component that together generate a utility maximizing behavior. The crucial meta-level knowledge is kept in the anytime library in the form of conditional performance profiles. These profiles characterize the performance of each elementary anytime algorithm as a function of run-time and input quality. The compilation process therefore extends the principles of procedural abstraction and modularity to anytime computation. Its efficiency is significantly improved by using local compilation that works on a single program structure at a time. Local compilation is proved to yield global optimality for a large set of program structures.

Compilation produces contract algorithms which require the determination of the total run-time when activated. Some real-time domains require interruptible algorithms whose total run-time is unknown in advance. An important result of this work is a general method by which an interruptible algorithm can be constructed once a contract algorithm is compiled. Finally, the notion of gradual improvement of quality is extended to sensing and plan execution and the application of the model is demonstrated through a simulated robot navigation system. The result is a modular approach for developing real-time agents that act by performing anytime actions and make decisions using anytime computation.},

keywords = {},

pubstate = {published},

tppubtype = {phdthesis}

}

An important and largely ignored aspect of real-time decision making is the capability of agents to factor the cost of deliberation into the decision making process. I have developed an efficient model that creates this capability. The model uses as basic components anytime algorithms whose quality of results improves gradually as computation time increases. The main contribution of this work is a compilation process that extends the property of gradual improvement from the level of single algorithms to the level of complex systems.
In standard algorithms, the fixed quality of the output allows for composition to be implemented by a simple call-return mechanism. However, when algorithms have resource allocation as a degree of freedom, there arises the question of how to construct, for example, the optimal composition of two anytime algorithms, one of which feeds its output to the other. This scheduling problem is solved by an off-line compilation process and a run-time monitoring component that together generate a utility maximizing behavior. The crucial meta-level knowledge is kept in the anytime library in the form of conditional performance profiles. These profiles characterize the performance of each elementary anytime algorithm as a function of run-time and input quality. The compilation process therefore extends the principles of procedural abstraction and modularity to anytime computation. Its efficiency is significantly improved by using local compilation that works on a single program structure at a time. Local compilation is proved to yield global optimality for a large set of program structures.
Compilation produces contract algorithms which require the determination of the total run-time when activated. Some real-time domains require interruptible algorithms whose total run-time is unknown in advance. An important result of this work is a general method by which an interruptible algorithm can be constructed once a contract algorithm is compiled. Finally, the notion of gradual improvement of quality is extended to sensing and plan execution and the application of the model is demonstrated through a simulated robot navigation system. The result is a modular approach for developing real-time agents that act by performing anytime actions and make decisions using anytime computation.

Zilberstein, Shlomo; Russell, Stuart J

Anytime Sensing, Planning and Action: A Practical Model for Robot Control Conference

Proceedings of the 13th International Joint Conference on Artificial Intelligence (IJCAI), Chambery, France, 1993.

Abstract | Links | BibTeX

Russell, Stuart J; Zilberstein, Shlomo

Composing Real-Time Systems Conference

Proceedings of the 12th International Joint Conference on Artificial Intelligence (IJCAI), Sydney, Australia, 1991.

Abstract | Links | BibTeX

Models of Bounded Rationality

What does it mean for an agent to be “rational” when it does not have enough knowledge or computational power to derive the best course of action?

Show Related Publications

Svegliato, Justin; Basich, Connor; Saisubramanian, Sandhya; Zilberstein, Shlomo

Metareasoning for Safe Decision Making in Autonomous Systems Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Philadelphia, Pennsylvania, 2022.

Abstract | Links | BibTeX

Bhatia, Abhinav; Svegliato, Justin; Nashed, Samer B.; Zilberstein, Shlomo

Tuning the Hyperparameters of Anytime Planning: A Metareasoning Approach with Deep Reinforcement Learning Conference

Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS), Virtual Conference, 2022.

Abstract | Links | BibTeX

Svegliato, Justin; Sharma, Prakhar; Zilberstein, Shlomo

A Model-Free Approach to Meta-Level Control of Anytime Algorithms Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 2020.

Abstract | Links | BibTeX

Carlin, Alan; Zilberstein, Shlomo

Decentralized Monitoring of Distributed Anytime Algorithms Conference

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Taipei, Taiwan, 2011.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Metareasoning and Bounded Rationality Book Section

In: Cox, M; Raja, A (Ed.): Metareasoning: Thinking about Thinking, pp. 27–40, MIT Press, Cambridge, MA, USA, 2011.

Carling, Alan; Zilberstein, Shlomo

Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning Book Section

In: Guy, T; Karny, M; Wolpert, D (Ed.): Decision Making with Imperfect Decision Makers, pp. 1–28, Springer, Berlin, Heidelberg, 2011.

Petrik, Marek; Zilberstein, Shlomo

Learning Parallel Portfolios of Algorithms Journal Article

In: Annals of Mathematics and Artificial Intelligence (AMAI), vol. 48, no. 1-2, pp. 85–106, 2006.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Learning Static Parallel Portfolios of Algorithms Conference

Proceedings of the 9th International Symposium on Artificial Intelligence and Mathematics (ISAIM), Ft. Lauderdale, Florida, 2006.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

Monitoring and Control of Anytime Algorithms: A Dynamic Programming Approach Journal Article

In: Artificial Intelligence (AIJ), vol. 126, no. 1-2, pp. 139–157, 2001.

Abstract | Links | BibTeX

Cardon, Stephane; Mouaddib, Abdel-Illah; Zilberstein, Shlomo; Washington, Richard

Adaptive Control of Acyclic Progressive Processing Task Structures Conference

Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, Washington, 2001.

Abstract | Links | BibTeX

Zilberstein, Shlomo; Mouaddib, Abdel-Illah

Optimal Scheduling of Progressive Processing Tasks Journal Article

In: International Journal of Approximate Reasoning (IJAR), vol. 25, no. 3, pp. 169–186, 2000.

Abstract | Links | BibTeX

Zilberstein, Shlomo; Mouaddib, Abdel-Illah

Reactive Control of Dynamic Progressive Processing Conference

Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, 1999.

Abstract | Links | BibTeX

Mouaddib, Abdel-Illah; Zilberstein, Shlomo

Optimal Scheduling of Dynamic Progressive Processing Conference

Proceedings of the 13th European Conference on Artificial Intelligence (ECAI), Brighton, UK, 1998, (Best Paper Award).

Abstract | Links | BibTeX

Mouaddib, Abdel-Illah; Zilberstein, Shlomo

Handling Duration Uncertainty in Meta-Level Control of Progressive Processing Conference

Proceedings of the 15th International Joint Conference on Artificial Intelligence (IJCAI), Nagoya, Japan, 1997.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Resource-Bounded Sensing and Planning in Autonomous Systems Journal Article

In: Autonomous Robots, vol. 3, pp. 31–48, 1996.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

Monitoring the Progress of Anytime Problem-Solving Conference

Proceedings of the 13th National Conference on Artificial Intelligence (AAAI), Portland, Oregon, 1996.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Optimizing Decision Quality with Contract Algorithms Conference

Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), Montreal, Canada, 1995.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Operational Rationality through Compilation of Anytime Algorithms PhD Thesis

Computer Science Division, University of California Berkeley, 1993.

Abstract | Links | BibTeX

@phdthesis{SZ:Zshort93,

title = {Operational Rationality through Compilation of Anytime Algorithms},

author = {Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/Zshort93.pdf},

year = {1993},

date = {1993-01-01},

school = {Computer Science Division, University of California Berkeley},

abstract = {An important and largely ignored aspect of real-time decision making is the capability of agents to factor the cost of deliberation into the decision making process. I have developed an efficient model that creates this capability. The model uses as basic components anytime algorithms whose quality of results improves gradually as computation time increases. The main contribution of this work is a compilation process that extends the property of gradual improvement from the level of single algorithms to the level of complex systems.

In standard algorithms, the fixed quality of the output allows for composition to be implemented by a simple call-return mechanism. However, when algorithms have resource allocation as a degree of freedom, there arises the question of how to construct, for example, the optimal composition of two anytime algorithms, one of which feeds its output to the other. This scheduling problem is solved by an off-line compilation process and a run-time monitoring component that together generate a utility maximizing behavior. The crucial meta-level knowledge is kept in the anytime library in the form of conditional performance profiles. These profiles characterize the performance of each elementary anytime algorithm as a function of run-time and input quality. The compilation process therefore extends the principles of procedural abstraction and modularity to anytime computation. Its efficiency is significantly improved by using local compilation that works on a single program structure at a time. Local compilation is proved to yield global optimality for a large set of program structures.

Compilation produces contract algorithms which require the determination of the total run-time when activated. Some real-time domains require interruptible algorithms whose total run-time is unknown in advance. An important result of this work is a general method by which an interruptible algorithm can be constructed once a contract algorithm is compiled. Finally, the notion of gradual improvement of quality is extended to sensing and plan execution and the application of the model is demonstrated through a simulated robot navigation system. The result is a modular approach for developing real-time agents that act by performing anytime actions and make decisions using anytime computation.},

keywords = {},

pubstate = {published},

tppubtype = {phdthesis}

}

An important and largely ignored aspect of real-time decision making is the capability of agents to factor the cost of deliberation into the decision making process. I have developed an efficient model that creates this capability. The model uses as basic components anytime algorithms whose quality of results improves gradually as computation time increases. The main contribution of this work is a compilation process that extends the property of gradual improvement from the level of single algorithms to the level of complex systems.
In standard algorithms, the fixed quality of the output allows for composition to be implemented by a simple call-return mechanism. However, when algorithms have resource allocation as a degree of freedom, there arises the question of how to construct, for example, the optimal composition of two anytime algorithms, one of which feeds its output to the other. This scheduling problem is solved by an off-line compilation process and a run-time monitoring component that together generate a utility maximizing behavior. The crucial meta-level knowledge is kept in the anytime library in the form of conditional performance profiles. These profiles characterize the performance of each elementary anytime algorithm as a function of run-time and input quality. The compilation process therefore extends the principles of procedural abstraction and modularity to anytime computation. Its efficiency is significantly improved by using local compilation that works on a single program structure at a time. Local compilation is proved to yield global optimality for a large set of program structures.
Compilation produces contract algorithms which require the determination of the total run-time when activated. Some real-time domains require interruptible algorithms whose total run-time is unknown in advance. An important result of this work is a general method by which an interruptible algorithm can be constructed once a contract algorithm is compiled. Finally, the notion of gradual improvement of quality is extended to sensing and plan execution and the application of the model is demonstrated through a simulated robot navigation system. The result is a modular approach for developing real-time agents that act by performing anytime actions and make decisions using anytime computation.

Zilberstein, Shlomo; Russell, Stuart J

Anytime Sensing, Planning and Action: A Practical Model for Robot Control Conference

Proceedings of the 13th International Joint Conference on Artificial Intelligence (IJCAI), Chambery, France, 1993.

Abstract | Links | BibTeX

Scalable Algorithms for Probabilistic Reasoning

How can AI systems cope with uncertainty in large sequential decision problems, and how to leverage heuristic search and reachability analysis to solve complex probabilistic planning problems?

Show Related Publications

Miura, Shuwa; Zilberstein, Shlomo

Observer-Aware Planning with Implicit and Explicit Communication Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Bhatia, Abhinav; Nashed, Samer B.; Zilberstein, Shlomo

RL3: Boosting Meta Reinforcement Learning via RL inside RL2 Conference

NeurIPS Workshop on Generalized Planning (GenPlan), New Orleans, Louisiana, 2023.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Nashed, Samer B.; Goldman, Claudia V.; Zilberstein, Shlomo

Estimating Causal Responsibility for Explaining Autonomous Behavior Book Section

In: Calvaresi, Davide (Ed.): International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS), pp. 78–94, Springer, 2023.

Abstract | Links | BibTeX

@incollection{SZ:MNGZextraamas23,

title = {Estimating Causal Responsibility for Explaining Autonomous Behavior},

author = {Saaduddin Mahmud and Samer B. Nashed and Claudia V. Goldman and Shlomo Zilberstein},

editor = {Davide Calvaresi},

url = {http://rbr.cs.umass.edu/shlomo/papers/MNGZextraamas23.pdf},

doi = {10.1007/978-3-031-40878-6},

year  = {2023},

date = {2023-01-01},

booktitle = {International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS)},

pages = {78–94},

publisher = {Springer},

abstract = {There has been growing interest in causal explanations of stochastic, sequential decision-making systems. Structural causal models and causal reasoning offer several theoretical benefits when exact inference can be applied. Furthermore, users overwhelmingly prefer the resulting causal explanations over other state-of-the-art systems. In this work, we focus on one such method, MeanRESP, and its approximate versions that drastically reduce compute load and assign a responsibility score to each variable, which helps identify smaller sets of causes to be used as explanations. However, this method, and its approximate versions in particular, lack deeper theoretical analysis and broader empirical tests. To address these shortcomings, we provide three primary contributions. First, we offer several theoretical insights on the sample complexity and error rate of approximate MeanRESP. Second, we discuss several automated metrics for comparing explanations generated from approximate methods to those generated via exact methods. While we recognize the significance of user studies as the gold standard for evaluating explanations, our aim is to leverage the proposed metrics to systematically compare explanation-generation methods along important quantitative dimensions. Finally, we provide a more detailed discussion of MeanRESP and how its output under different definitions of responsibility compares to existing widely adopted methods that use Shapley values.},

keywords = {},

pubstate = {published},

tppubtype = {incollection}

}

Nashed, Samer; Zilberstein, Shlomo

A Survey of Opponent Modeling in Adversarial Domains Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 73, pp. 277–327, 2022.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo; Kamar, Ece

Avoiding Negative Side Effects of Autonomous Systems in the Open World Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 74, pp. 143–177, 2022.

Abstract | Links | BibTeX

@article{SZ:SZKjair22,

title = {Avoiding Negative Side Effects of Autonomous Systems in the Open World},

author = {Sandhya Saisubramanian and Shlomo Zilberstein and Ece Kamar},

url = {https://www.jair.org/index.php/jair/article/view/13581/26799},

doi = {10.1613/jair.1.13581},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {74},

pages = {143--177},

abstract = {Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning Via Introspective Perception Journal Article

In: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3218–3225, 2022.

Abstract | Links | BibTeX

Svegliato, Justin; Basich, Connor; Saisubramanian, Sandhya; Zilberstein, Shlomo

Metareasoning for Safe Decision Making in Autonomous Systems Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Philadelphia, Pennsylvania, 2022.

Abstract | Links | BibTeX

Miura, Shuwa; Wray, Kyle Hollins; Zilberstein, Shlomo

Heuristic Search for SSPs with Lexicographic Preferences over Multiple Costs Conference

Proceedings of the 15th Annual Symposium on Combinatorial Search (SOCS), Vienna, Austria, 2022.

Abstract | Links | BibTeX

Basich, Connor; Peterson, John; Zilberstein, Shlomo

Planning with Intermittent State Observability: Knowing When to Act Blind Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022.

Abstract | Links | BibTeX

Nashed, Samer B.; Svegliato, Justin; Bhatia, Abhinav; Russell, Stuart; Zilberstein, Shlomo

Selecting the Partial State Abstractions of MDPs: A Metareasoning Approach with Deep Reinforcement Learning Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022.

Abstract | Links | BibTeX

Nashed, Samer B; Svegliato, Justin; Brucato, Matteo; Basich, Connor; Grupen, Roderic A; Zilberstein, Shlomo

Solving Markov Decision Processes with Partial State Abstractions Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2021.

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Beach, Allyson; Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Improving Competence via Iterative State Space Refinement Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021.

Abstract | Links | BibTeX

Parr, Shane; Khatri, Ishan; Svegliato, Justin; Zilberstein, Shlomo

Agent-Aware State Estimation in Autonomous Vehicles Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021.

Abstract | Links | BibTeX

Basich, Connor; Wang, Daniel; Russino, Joseph; Chien, Steve; Zilberstein, Shlomo

A Sampling-Based Optimization Approach to Handling Environmental Uncertainty for a Planetary Lander Conference

ICAPS Workshop on Planning and Robotics (PlanRob), Guangzhou, China, 2021.

Abstract | BibTeX

Miura, Shuwa; Cohen, Andrew L; Zilberstein, Shlomo

Maximizing Legibility in Stochastic Environments Conference

Proceedings of the 30th IEEE International Conference on Robot & Human Interactive Communication, (RO-MAN), Vancouver, BC, Canada, 2021.

Abstract | Links | BibTeX

Miura, Shuwa; Zilberstein, Shlomo

A Unifying Framework for Observer-Aware Planning and its Complexity Conference

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (UAI), Virtual Event, 2021.

Abstract | Links | BibTeX

Pineda, Luis; Zilberstein, Shlomo

Soft Labeling in Stochastic Shortest Path Problems Conference

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Montreal, Quebec, CA, 2019.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Wray, Kyle Hollins; Pineda, Luis Enrique; Zilberstein, Shlomo

Planning in Stochastic Environments with Goal Uncertainty Conference

ICAPS Workshop on Planning and Robotics (PlanRob), Berkeley, CA, 2019.

Abstract | BibTeX

Saisubramanian, Sandhya; Basich, Connor; Zilberstein, Shlomo; Goldman, Claudia V

The Value of Incorporating Social Preferences in Dynamic Ridesharing Conference

ICAPS Workshop on Scheduling and Planning Applications (SPARK), Berkeley, CA, 2019.

Abstract | BibTeX

Pineda, Luis Enrique; Zilberstein, Shlomo

Probabilistic Planning with Reduced Models Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 65, pp. 271–306, 2019.

Abstract | Links | BibTeX

@article{SZ:PZjair19,

title = {Probabilistic Planning with Reduced Models},

author = {Luis Enrique Pineda and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/PZjair19.pdf},

doi = {10.1613/jair.1.11569},

year  = {2019},

date = {2019-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {65},

pages = {271--306},

abstract = {Reduced models are simplified versions of a given domain, designed to accelerate the planning process. Interest in reduced models has grown since the surprising success of determinization in the first international probabilistic planning competition, leading to the development of several enhanced determinization techniques. To address the drawbacks of previous determinization methods, we introduce a family of reduced models in which probabilistic outcomes are classified as one of two types: primary and exceptional. In each model that belongs to this family of reductions, primary outcomes can occur an unbounded number of times per trajectory, while exceptions can occur at most a finite number of times, specified by a parameter. Distinct reduced models are characterized by two parameters: the maximum number of primary outcomes per action, and the maximum number of occurrences of exceptions per trajectory. This family of reductions generalizes the well-known most-likely-outcome determinization approach, which includes one primary outcome per action and zero exceptional outcomes per plan. We present a framework to determine the benefits of planning with reduced models, and develop a continual planning approach that handles situations where the number of exceptions exceeds the specified bound during plan execution. Using this framework, we compare the performance of various reduced models and consider the challenge of generating good ones automatically. We show that each one of the dimensions--allowing more than one primary outcome or planning for some limited number of exceptions--could improve performance relative to standard determinization. The results place previous work on determinization in a broader context and lay the foundation for a systematic exploration of the space of model reductions.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Saisubramanian, Sandhya; Wray, Kyle Hollins; Pineda, Luis Enrique; Zilberstein, Shlomo

Planning in Stochastic Environments with Goal Uncertainty Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019.

Abstract | Links | BibTeX

Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan J; Biswas, Joydeep; Zilberstein, Shlomo

Belief Space Metareasoning for Exception Recovery Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Adaptive Outcome Selection for Planning With Reduced Models Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo; Shenoy, Prashant J

Planning Using a Portfolio of Reduced Models Conference

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Stockholm, Sweden, 2018.

Abstract | Links | BibTeX

Srivastava, Siddharth; Desai, Nishant; Freedman, Richard G; Zilberstein, Shlomo

An Anytime Algorithm for Task and Motion MDPs Conference

ICAPS Workshop on Planning and Robotics (PlanRob), Delft, The Netherlands, 2018.

Abstract | Links | BibTeX

Pineda, Luis Enrique; Wray, Kyle Hollins; Zilberstein, Shlomo

Fast SSP Solvers Using Short-Sighted Labeling Conference

Proceedings of the 31st Conference on Artificial Intelligence (AAAI), San Francisco, California, 2017.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo; Mouaddib, Abdel-Illah

Multi-Objective MDPs with Conditional Lexicographic Reward Preferences Conference

Proceedings of the 29th Conference on Artificial Intelligence (AAAI), Austin, Texas, 2015.

Abstract | Links | BibTeX

Pineda, Luis; Wray, Kyle Hollins; Zilberstein, Shlomo

Revisiting Multi-Objective MDPs with Relaxed Lexicographic Preferences Conference

AAAI Fall Symposium on Sequential Decision Making for Intelligent Agents (SDMIA), Arlington, Virginia, 2015.

Abstract | Links | BibTeX

Pineda, Luis; Zilberstein, Shlomo

Planning Under Uncertainty Using Reduced Models: Revisiting Determinization Conference

Proceedings of the 24th International Conference on Automated Planning and Scheduling (ICAPS), Portsmouth, New Hampshire, 2014.

Abstract | Links | BibTeX

Pineda, Luis; Lu, Yi; Zilberstein, Shlomo; Goldman, Claudia V

Fault-Tolerant Planning Under Uncertainty Conference

Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Robust Approximate Bilinear Programming for Value Function Approximation Journal Article

In: Journal of Machine Learning Research (JMLR), vol. 12, pp. 3027–3063, 2011.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Linear Dynamic Programs for Resource Management Conference

Proceedings of the 25th Conference on Artificial Intelligence (AAAI), San Francisco, California, 2011.

Abstract | Links | BibTeX

Wu, Xiaojian; Kumar, Akshat; Zilberstein, Shlomo

Influence Diagrams with Memory States: Representation and Algorithms Conference

Proceedings of the 2nd International Conference on Algorithmic Decision Theory (ADT), Piscataway, New Jersey, 2011.

Abstract | Links | BibTeX

Petrik, Marek; Taylor, Gavin; Parr, Ron; Zilberstein, Shlomo

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes Conference

Proceedings of the 27th International Conference on Machine Learning (ICML), Haifa, Israel, 2010.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Constraint Relaxation in Approximate Linear Programs Conference

Proceedings of the 26th International Conference on Machine Learning (ICML), Montreal, Canada, 2009.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Robust Value Function Approximation Using Bilinear Programming Conference

Proceedings of the 23rd Neural Information Processing Systems Conference (NIPS), Vancouver, British Columbia, Canada, 2009.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Average-Reward Decentralized Markov Decision Processes Conference

Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, 2007.

Abstract | Links | BibTeX

Feng, Zhengzhu; Hansen, Eric A; Zilberstein, Shlomo

Symbolic Generalization for On-line Planning Conference

Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence (UAI), Acapulco, Mexico, 2003.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

LAO*: A Heuristic Search Algorithm that Finds Solutions with Loops Journal Article

In: Artificial Intelligence (AIJ), vol. 129, no. 1-2, pp. 35–62, 2001.

Abstract | Links | BibTeX

Hansen, Eric A; Zilberstein, Shlomo

Heuristic Search in Cyclic AND/OR Graphs Conference

Proceedings of the 15th National Conference on Artificial Intelligence (AAAI), Madison, Wisconsin, 1998.

Abstract | Links | BibTeX

Hansen, Eric A; Barto, Andrew G; Zilberstein, Shlomo

Reinforcement Learning for Mixed Open-loop and Closed-loop Control Conference

Proceedings of the 9th Neural Information Processing Systems Conference (NIPS), Denver, Colorado, 1996.

Abstract | Links | BibTeX

Belief-Space Planning and POMDPs

How to select actions based on partial and imprecise information about the environment, and how to design efficient algorithms to do planning in belief space?

Show Related Publications

Miura, Shuwa; Zilberstein, Shlomo

Observer-Aware Planning with Implicit and Explicit Communication Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Vazquez-Chanlatte, Marcell; Witwicki, Stefan; Zilberstein, Shlomo

Explaining the Behavior of POMDP-based Agents Through the Impact of Counterfactual Information Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Basich, Connor; Peterson, John; Zilberstein, Shlomo

Planning with Intermittent State Observability: Knowing When to Act Blind Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022.

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Beach, Allyson; Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Improving Competence via Iterative State Space Refinement Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021.

Abstract | Links | BibTeX

Parr, Shane; Khatri, Ishan; Svegliato, Justin; Zilberstein, Shlomo

Agent-Aware State Estimation in Autonomous Vehicles Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021.

Abstract | Links | BibTeX

Miura, Shuwa; Cohen, Andrew L; Zilberstein, Shlomo

Maximizing Legibility in Stochastic Environments Conference

Proceedings of the 30th IEEE International Conference on Robot & Human Interactive Communication, (RO-MAN), Vancouver, BC, Canada, 2021.

Abstract | Links | BibTeX

Miura, Shuwa; Zilberstein, Shlomo

A Unifying Framework for Observer-Aware Planning and its Complexity Conference

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (UAI), Virtual Event, 2021.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Generalized Controllers in POMDP Decision-Making Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Montreal, Quebec, CA, 2019.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Wray, Kyle Hollins; Pineda, Luis Enrique; Zilberstein, Shlomo

Planning in Stochastic Environments with Goal Uncertainty Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Approximating Reachable Belief Points in POMDPs with Applications to Robotic Navigation and Localization Conference

ICAPS Workshop on Planning and Robotics (PlanRob), Pittsburgh, Pennsylvania, 2017.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Approximating reachable belief points in POMDPs Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 2017.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

A POMDP Formulation of Proactive Learning Conference

Proceedings of the 30th Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, 2016.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

History-Based Controller Design and Optimization for Partially Observable MDPs Conference

Proceedings of the 25th International Conference on Automated Planning and Scheduling (ICAPS), Jerusalem, Israel, 2015.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Multi-Objective POMDPs with Lexicographic Reward Preferences Conference

Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI), Buenos Aires, Argentina, 2015.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

A Parallel Point-Based POMDP Algorithm Leveraging GPUs Conference

AAAI Fall Symposium on Sequential Decision Making for Intelligent Agents (SDMIA), Arlington, Virginia, 2015.

Abstract | Links | BibTeX

Amato, Christopher; Bernstein, Daniel S; Zilberstein, Shlomo

Optimizing Fixed-Size Stochastic Controllers for POMDPs and Decentralized POMDPs Journal Article

In: Autonomous Agents and Multi-Agent Systems (JAAMAS), vol. 21, no. 3, pp. 293–320, 2010.

Abstract | Links | BibTeX

@article{SZ:ABZjaamas10,

title = {Optimizing Fixed-Size Stochastic Controllers for POMDPs and Decentralized POMDPs},

author = {Christopher Amato and Daniel S Bernstein and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/ABZjaamas10.pdf},

doi = {10.1007/s10458-009-9103-z},

year  = {2010},

date = {2010-01-01},

journal = {Autonomous Agents and Multi-Agent Systems (JAAMAS)},

volume = {21},

number = {3},

pages = {293--320},

abstract = {Coordination of distributed agents is required for problems arising in many areas, including multi-robot systems, networking and e-commerce. As a formal framework for such problems, we use the decentralized partially observable Markov decision process (DEC-POMDP). Though much work has been done on optimal dynamic programming algorithms for the single-agent version of the problem, optimal algorithms for the multiagent case have been elusive. The main contribution of this paper is an optimal policy iteration algorithm for solving DEC-POMDPs. The algorithm uses stochastic finite-state controllers to represent policies. The solution can include a correlation device, which allows agents to correlate their actions without communicating. This approach alternates between expanding the controller and performing value-preserving transformations, which modify the controller without sacrificing value. We present two Efficient value-preserving transformations: one can reduce the size of the controller and the other can improve its value while keeping the size fixed. Empirical results demonstrate the usefulness of value-preserving transformations in increasing value while keeping controller size to a minimum. To broaden the applicability of the approach, we also present a heuristic version of the policy iteration algorithm, which sacrifices convergence to optimality. This algorithm further reduces the size of the controllers at each step by assuming that probability distributions over the other agents' actions are known. While this assumption may not hold in general, it helps produce higher quality solutions in our test problems.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Amato, Christopher; Bonet, Blai; Zilberstein, Shlomo

Finite-State Controllers Based on Mealy Machines for Centralized and Decentralized POMDPs Conference

Proceedings of the 24th Conference on Artificial Intelligence (AAAI), Atlanta, Georgia, 2010.

Abstract | Links | BibTeX

Amato, Christopher; Bernstein, Daniel S; Zilberstein, Shlomo

Solving POMDPs Using Quadratically Constrained Linear Programs Conference

Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, 2007.

Abstract | Links | BibTeX

Feng, Zhengzhu; Zilberstein, Shlomo

Efficient Maximization in Solving POMDPs Conference

Proceedings of the 20th National Conference on Artificial Intelligence (AAAI), Pittsburgh, Pennsylvania, 2005.

Abstract | Links | BibTeX

Feng, Zhengzhu; Zilberstein, Shlomo

Region-Based Incremental Pruning for POMDPs Conference

Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (UAI), Banff, Canada, 2004.

Abstract | Links | BibTeX

Multiagent Planning and DEC-POMDPs

How can a group of intelligent agents coordinate their decisions in spite of stochasticity and limited information, and how to extend decision-theoretic models to such complex multiagent settings?

Show Related Publications

65 entries « ‹ 1 of 2 › »

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 37th International FLAIRS Conference, Miramar Beach, Florida, 2024.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Nashed, Samer B.; Goldman, Claudia V.; Zilberstein, Shlomo

Estimating Causal Responsibility for Explaining Autonomous Behavior Book Section

In: Calvaresi, Davide (Ed.): International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS), pp. 78–94, Springer, 2023.

Abstract | Links | BibTeX

@incollection{SZ:MNGZextraamas23,

title = {Estimating Causal Responsibility for Explaining Autonomous Behavior},

author = {Saaduddin Mahmud and Samer B. Nashed and Claudia V. Goldman and Shlomo Zilberstein},

editor = {Davide Calvaresi},

url = {http://rbr.cs.umass.edu/shlomo/papers/MNGZextraamas23.pdf},

doi = {10.1007/978-3-031-40878-6},

year  = {2023},

date = {2023-01-01},

booktitle = {International Workshop on Explainable and Transparent AI and Multi-Agent Systems (EXTRAAMAS)},

pages = {78–94},

publisher = {Springer},

abstract = {There has been growing interest in causal explanations of stochastic, sequential decision-making systems. Structural causal models and causal reasoning offer several theoretical benefits when exact inference can be applied. Furthermore, users overwhelmingly prefer the resulting causal explanations over other state-of-the-art systems. In this work, we focus on one such method, MeanRESP, and its approximate versions that drastically reduce compute load and assign a responsibility score to each variable, which helps identify smaller sets of causes to be used as explanations. However, this method, and its approximate versions in particular, lack deeper theoretical analysis and broader empirical tests. To address these shortcomings, we provide three primary contributions. First, we offer several theoretical insights on the sample complexity and error rate of approximate MeanRESP. Second, we discuss several automated metrics for comparing explanations generated from approximate methods to those generated via exact methods. While we recognize the significance of user studies as the gold standard for evaluating explanations, our aim is to leverage the proposed metrics to systematically compare explanation-generation methods along important quantitative dimensions. Finally, we provide a more detailed discussion of MeanRESP and how its output under different definitions of responsibility compares to existing widely adopted methods that use Shapley values.},

keywords = {},

pubstate = {published},

tppubtype = {incollection}

}

Parr, Shane; Khatri, Ishan; Svegliato, Justin; Zilberstein, Shlomo

Agent-Aware State Estimation in Autonomous Vehicles Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Jennings, Nicholas R

Multi-Agent Planning with High-Level Human Guidance Conference

Proceedings of Principles and Practice of Multi-Agent Systems (PRIMA), 2020.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Jennings, Nicholas R

Stochastic Multi-agent Planning with Partial State Models Conference

Proceedings of the First International Conference on Distributed Artificial Intelligence (DAI), Beijing, China, 2019.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Privacy-Preserving Policy Iteration for Decentralized POMDPs Conference

Proceedings of the 32nd Conference on Artificial Intelligence (AAAI), New Orleans, Louisiana, 2018.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Kumar, Akshat; Zilberstein, Shlomo

Integrated Cooperation and Competition in Multi-Agent Decision-Making Conference

Proceedings of the 32nd Conference on Artificial Intelligence (AAAI), New Orleans, Louisiana, 2018.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Multi-Agent Planning with Baseline Regret Minimization Conference

Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), 2017.

Abstract | Links | BibTeX

Kumar, Akshat; Mostafa, Hala; Zilberstein, Shlomo

Dual Formulations for Optimizing Dec-POMDP Controllers Conference

Proceedings of the 26th International Conference on Automated Planning and Scheduling (ICAPS), London, UK, 2016.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo; Toussaint, Marc

Probabilistic Inference Techniques for Scalable Multiagent Decision Making Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 53, pp. 223–270, 2015.

Abstract | Links | BibTeX

Nguyen, Duc Thien; Yeoh, William; Lau, Hoong Chuin; Zilberstein, Shlomo; Zhang, Chongjie

Decentralized Multi-Agent Reinforcement Learning in Average-Reward Dynamic DCOPs Conference

Proceedings of the 28th Conference on Artificial Intelligence (AAAI), Quebec City, Canada, 2014.

Abstract | Links | BibTeX

Brafman, Ronen I; Shani, Guy; Zilberstein, Shlomo

Qualitative Planning under Partial Observability in Multi-Agent Domains Conference

Proceedings of the 27th Conference on Artificial Intelligence (AAAI), Bellevue, Washington, 2013.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Jennings, Nicholas R

Monte-Carlo Expectation Maximization for Decentralized POMDPs Conference

Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.

Abstract | Links | BibTeX

Yeoh, William; Kumar, Akshat; Zilberstein, Shlomo

Automated Generation of Interaction Graphs for Value-Factored Dec-POMDPs Conference

Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.

Abstract | Links | BibTeX

Durfee, Edmund; Zilberstein, Shlomo

Multiagent Planning, Control, and Execution Book Section

In: Weiss, G (Ed.): Multiagent Systems, Second Edition, pp. 485–546, MIT Press, Cambridge, MA, USA, 2013.

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Online Planning for Multi-Agent Systems with Bounded Communication Journal Article

In: Artificial Intelligence (AIJ), vol. 175, no. 2, pp. 487–511, 2011.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Message-Passing Algorithms for Large Structured Decentralized POMDPs Conference

Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Taipei, Taiwan, 2011.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo; Toussaint, Marc

Scalable Multiagent Planning Using Probabilistic Inference Conference

Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI), Barcelona, Spain, 2011.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Online Planning for Ad Hoc Autonomous Agent Teams Conference

Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI), Barcelona, Spain, 2011.

Abstract | Links | BibTeX

Amato, Christopher; Bernstein, Daniel S; Zilberstein, Shlomo

Optimizing Fixed-Size Stochastic Controllers for POMDPs and Decentralized POMDPs Journal Article

In: Autonomous Agents and Multi-Agent Systems (JAAMAS), vol. 21, no. 3, pp. 293–320, 2010.

Abstract | Links | BibTeX

@article{SZ:ABZjaamas10,

title = {Optimizing Fixed-Size Stochastic Controllers for POMDPs and Decentralized POMDPs},

author = {Christopher Amato and Daniel S Bernstein and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/ABZjaamas10.pdf},

doi = {10.1007/s10458-009-9103-z},

year  = {2010},

date = {2010-01-01},

journal = {Autonomous Agents and Multi-Agent Systems (JAAMAS)},

volume = {21},

number = {3},

pages = {293--320},

abstract = {Coordination of distributed agents is required for problems arising in many areas, including multi-robot systems, networking and e-commerce. As a formal framework for such problems, we use the decentralized partially observable Markov decision process (DEC-POMDP). Though much work has been done on optimal dynamic programming algorithms for the single-agent version of the problem, optimal algorithms for the multiagent case have been elusive. The main contribution of this paper is an optimal policy iteration algorithm for solving DEC-POMDPs. The algorithm uses stochastic finite-state controllers to represent policies. The solution can include a correlation device, which allows agents to correlate their actions without communicating. This approach alternates between expanding the controller and performing value-preserving transformations, which modify the controller without sacrificing value. We present two Efficient value-preserving transformations: one can reduce the size of the controller and the other can improve its value while keeping the size fixed. Empirical results demonstrate the usefulness of value-preserving transformations in increasing value while keeping controller size to a minimum. To broaden the applicability of the approach, we also present a heuristic version of the policy iteration algorithm, which sacrifices convergence to optimality. This algorithm further reduces the size of the controllers at each step by assuming that probability distributions over the other agents' actions are known. While this assumption may not hold in general, it helps produce higher quality solutions in our test problems.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Kumar, Akshat; Zilberstein, Shlomo

Point-Based Backup for Decentralized POMDPs: Complexity and New Algorithms Conference

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Toronto, Canada, 2010.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Point-Based Policy Generation for Decentralized POMDPs Conference

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Toronto, Canada, 2010.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Anytime Planning for Decentralized POMDPs using Expectation Maximization Conference

Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, California, 2010.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Rollout Sampling Policy Iteration for Decentralized POMDPs Conference

Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI), Catalina Island, California, 2010.

Abstract | Links | BibTeX

Amato, Christopher; Bonet, Blai; Zilberstein, Shlomo

Finite-State Controllers Based on Mealy Machines for Centralized and Decentralized POMDPs Conference

Proceedings of the 24th Conference on Artificial Intelligence (AAAI), Atlanta, Georgia, 2010.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Trial-Based Dynamic Programming for Multi-Agent Planning Conference

Proceedings of the 24th Conference on Artificial Intelligence (AAAI), Atlanta, Georgia, 2010.

Abstract | Links | BibTeX

Bernstein, Daniel S; Amato, Christopher; Hansen, Eric A; Zilberstein, Shlomo

Policy Iteration for Decentralized Control of Markov Decision Processes Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 34, pp. 89–132, 2009.

Abstract | Links | BibTeX

@article{SZ:BAHZjair09,

title = {Policy Iteration for Decentralized Control of Markov Decision Processes},

author = {Daniel S Bernstein and Christopher Amato and Eric A Hansen and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/BAHZjair09.pdf},

doi = {10.1613/jair.2667},

year  = {2009},

date = {2009-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {34},

pages = {89--132},

abstract = {Coordination of distributed agents is required for problems arising in many areas, including multi-robot systems, networking and e-commerce. As a formal framework for such problems, we use the decentralized partially observable Markov decision process (DEC-POMDP). Though much work has been done on optimal dynamic programming algorithms for the single-agent version of the problem, optimal algorithms for the multiagent case have been elusive. The main contribution of this paper is an optimal policy iteration algorithm for solving DEC-POMDPs. The algorithm uses stochastic finite-state controllers to represent policies. The solution can include a correlation device, which allows agents to correlate their actions without communicating. This approach alternates between expanding the controller and performing value-preserving transformations, which modify the controller without sacrificing value. We present two Efficient value-preserving transformations: one can reduce the size of the controller and the other can improve its value while keeping the size fixed. Empirical results demonstrate the usefulness of value-preserving transformations in increasing value while keeping controller size to a minimum. To broaden the applicability of the approach, we also present a heuristic version of the policy iteration algorithm, which sacrifices convergence to optimality. This algorithm further reduces the size of the controllers at each step by assuming that probability distributions over the other agents' actions are known. While this assumption may not hold in general, it helps produce higher quality solutions in our test problems.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Petrik, Marek; Zilberstein, Shlomo

A Bilinear Programming Approach for Multiagent Planning Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 35, pp. 235–274, 2009.

Abstract | Links | BibTeX

Becker, Raphen; Carlin, Alan; Lesser, Victor; Zilberstein, Shlomo

Analyzing Myopic Approaches for Multi-Agent Communication Journal Article

In: Computational Intelligence, vol. 25, no. 1, pp. 31–50, 2009.

Abstract | Links | BibTeX

Amato, Christopher; Zilberstein, Shlomo

Achieving Goals in Decentralized POMDPs Conference

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Budapest, Hungary, 2009.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions Conference

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Budapest, Hungary, 2009.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Dynamic Programming Approximations for Partially Observable Stochastic Games Conference

Proceedings of the 22nd International FLAIRS Conference, Sanibel Island, Florida, 2009.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Event-Detecting Multi-Agent MDPs: Complexity and Constant-Factor Approximation Conference

Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI), Pasadena, California, 2009.

Abstract | Links | BibTeX

Amato, Christopher; Dibangoye, Jilles Steeve; Zilberstein, Shlomo

Incremental Policy Generation for Finite-Horizon DEC-POMDPs Conference

Proceedings of the 19th International Conference on Automated Planning and Scheduling (ICAPS), Thessaloniki, Greece, 2009.

Abstract | Links | BibTeX

Wu, Feng; Zilberstein, Shlomo; Chen, Xiaoping

Multi-Agent Online Planning with Communication Conference

Proceedings of the 19th International Conference on Automated Planning and Scheduling (ICAPS), Thessaloniki, Greece, 2009.

Abstract | Links | BibTeX

Allen, Martin; Zilberstein, Shlomo

Complexity of Decentralized Control: Special Cases Conference

Proceedings of the 23rd Neural Information Processing Systems Conference (NIPS), Vancouver, British Columbia, Canada, 2009.

Abstract | Links | BibTeX

Goldman, Claudia V; Zilberstein, Shlomo

Communication-Based Decomposition Mechanisms for Decentralized MDPs Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 32, pp. 169–202, 2008.

Abstract | Links | BibTeX

@article{SZ:GZjair08,

title = {Communication-Based Decomposition Mechanisms for Decentralized MDPs},

author = {Claudia V Goldman and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/GZjair08.pdf},

doi = {10.1613/jair.2466},

year  = {2008},

date = {2008-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {32},

pages = {169--202},

abstract = {Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing, multi-robot coordination and information gathering scenarios can be formalized using this framework. However, finding the optimal solution in the general case is hard, limiting the applicability of recently developed algorithms. This paper provides a practical approach for solving decentralized control problems when communication among the decision makers is possible, but costly. We develop the notion of communication-based mechanism that allows us to decompose a decentralized MDP into multiple single-agent problems. In this framework, referred to as decentralized semi-Markov decision process with direct communication (Dec-SMDP-Com), agents operate separately between communications. We show that finding an optimal mechanism is equivalent to solving optimally a Dec-SMDP-Com. We also provide a heuristic search algorithm that converges on the optimal decomposition. Restricting the decomposition to some specific types of local behaviors reduces significantly the complexity of planning. In particular, we present a polynomial-time algorithm for the case in which individual agents perform goal-oriented behaviors between communications. The paper concludes with an additional tractable algorithm that enables the introduction of human knowledge, thereby reducing the overall problem to finding the best time to communicate. Empirical results show that these approaches provide good approximate solutions.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Seuken, Sven; Zilberstein, Shlomo

Formal Models and Algorithms for Decentralized Decision Making under Uncertainty Journal Article

In: Autonomous Agents and Multi-Agent Systems (JAAMAS), vol. 17, no. 2, pp. 190–250, 2008.

Abstract | Links | BibTeX

@article{SZ:SZjaamas08,

title = {Formal Models and Algorithms for Decentralized Decision Making under Uncertainty},

author = {Sven Seuken and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/SZjaamas08.pdf},

doi = {10.1007/s10458-007-9026-5},

year  = {2008},

date = {2008-01-01},

journal = {Autonomous Agents and Multi-Agent Systems (JAAMAS)},

volume = {17},

number = {2},

pages = {190--250},

abstract = {Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing, multi-robot coordination and information gathering scenarios can be formalized using this framework. However, finding the optimal solution in the general case is hard, limiting the applicability of recently developed algorithms. This paper provides a practical approach for solving decentralized control problems when communication among the decision makers is possible, but costly. We develop the notion of communication-based mechanism that allows us to decompose a decentralized MDP into multiple single-agent problems. In this framework, referred to as decentralized semi-Markov decision process with direct communication (Dec-SMDP-Com), agents operate separately between communications. We show that finding an optimal mechanism is equivalent to solving optimally a Dec-SMDP-Com. We also provide a heuristic search algorithm that converges on the optimal decomposition. Restricting the decomposition to some specific types of local behaviors reduces significantly the complexity of planning. In particular, we present a polynomial-time algorithm for the case in which individual agents perform goal-oriented behaviors between communications. The paper concludes with an additional tractable algorithm that enables the introduction of human knowledge, thereby reducing the overall problem to finding the best time to communicate. Empirical results show that these approaches provide good approximate solutions.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Petrik, Marek; Zilberstein, Shlomo

A Successive Approximation Algorithm for Coordination Problems Conference

Proceedings of the 10th International Symposium on Artificial Intelligence and Mathematics (ISAIM), Ft. Lauderdale, Florida, 2008.

Abstract | Links | BibTeX

Carlin, Alan; Zilberstein, Shlomo

Value-Based Observation Compression for DEC-POMDPs Conference

Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Estoril, Portugal, 2008.

Abstract | Links | BibTeX

Carlin, Alan; Zilberstein, Shlomo

Observation Compression in DEC-POMDP Policy Trees Conference

AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), Estoril, Portugal, 2008, (Best Paper Award).

Abstract | Links | BibTeX

Amato, Christopher; Zilberstein, Shlomo

What's Worth Memorizing: Attribute-based Planning for DEC-POMDPs Conference

ICAPS Workshop on Multiagent Planning, Sydney, Australia, 2008.

Abstract | Links | BibTeX

Goldman, Claudia V; Allen, Martin; Zilberstein, Shlomo

Learning to Communicate in a Decentralized Environment Journal Article

In: Autonomous Agents and Multi-Agent Systems (JAAMAS), vol. 15, no. 1, pp. 47–90, 2007.

Abstract | Links | BibTeX

Seuken, Sven; Zilberstein, Shlomo

Memory-Bounded Dynamic Programming for DEC-POMDPs Conference

Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, 2007.

Abstract | Links | BibTeX

Amato, Christopher; Bernstein, Daniel S; Zilberstein, Shlomo

Optimizing Memory-Bounded Controllers for Decentralized POMDPs Conference

Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI), Vancouver, British Columbia, 2007.

Abstract | Links | BibTeX

Seuken, Sven; Zilberstein, Shlomo

Improved Memory-Bounded Dynamic Programming for Decentralized POMDPs Conference

Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI), Vancouver, British Columbia, 2007.

Abstract | Links | BibTeX

Allen, Martin; Zilberstein, Shlomo

Agent Influence as a Predictor of Difficulty for Decentralized Problem-Solving Conference

Proceedings of the 22nd Conference on Artificial Intelligence (AAAI), Vancouver, British Columbia, 2007.

Abstract | Links | BibTeX

Petrik, Marek; Zilberstein, Shlomo

Anytime Coordination Using Separable Bilinear Programs Conference

Proceedings of the 22nd Conference on Artificial Intelligence (AAAI), Vancouver, British Columbia, 2007.

Abstract | Links | BibTeX

Szer, Daniel; Charpillet, Francois; Zilberstein, Shlomo

MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs Conference

Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI), Edinburgh, Scotland, 2005.

Abstract | Links | BibTeX

65 entries « ‹ 1 of 2 › »

Generalized Planning

How can agents create generalized plans, which are algorithm-like plans that include loops and branches, can handle unknown quantities of objects, and work for large classes of problem instances?

Show Related Publications

Bhatia, Abhinav; Nashed, Samer B.; Zilberstein, Shlomo

RL3: Boosting Meta Reinforcement Learning via RL inside RL2 Conference

NeurIPS Workshop on Generalized Planning (GenPlan), New Orleans, Louisiana, 2023.

Abstract | Links | BibTeX

Srivastava, Siddharth; Zilberstein, Shlomo; Gupta, Abhishek; Abbeel, Pieter; Russell, Stuart J

Tractability of Planning with Loops Conference

Proceedings of the 29th Conference on Artificial Intelligence (AAAI), Austin, Texas, 2015.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Applicability Conditions for Plans with Loops: Computability Results and Algorithms Journal Article

In: Artificial Intelligence (AIJ), vol. 191, pp. 1–19, 2012.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

A New Representation and Associated Algorithms for Generalized Planning Journal Article

In: Artificial Intelligence (AIJ), vol. 175, no. 2, pp. 615–647, 2011.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo; Zhang, Tianjiao

Directed Search for Generalized Plans Using Classical Planners Conference

Proceedings of the 21st International Conference on Automated Planning and Scheduling (ICAPS), Freiburg, Germany, 2011.

Abstract | Links | BibTeX

Srivastava, Siddharth; Zilberstein, Shlomo; Immerman, Neil; Geffner, Hector

Qualitative Numeric Planning Conference

Proceedings of the 25th Conference on Artificial Intelligence (AAAI), San Francisco, California, 2011.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Termination and Correctness Analysis of Cyclic Control Conference

Proceedings of the 25th Conference on Artificial Intelligence (AAAI Nectar Track), San Francisco, California, 2011.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Computing Applicability Conditions for Plans with Loops Conference

Proceedings of the 20th International Conference on Automated Planning and Scheduling (ICAPS), Toronto, Canada, 2010, (Best Paper Award).

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Merging Example Plans into Generalized Plans for Non-deterministic Environments Conference

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Toronto, Canada, 2010.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Abstract Planning with Unknown Object Quantities and Properties Conference

Proceedings of the 8th Symposium on Abstraction, Reformulation, and Approximation (SARA), Lake Arrowhead, California, 2009.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Using Abstraction for Generalized Planning Conference

Proceedings of the 10th International Symposium on Artificial Intelligence and Mathematics (ISAIM), Ft. Lauderdale, Florida, 2008.

Abstract | Links | BibTeX

Srivastava, Siddharth; Immerman, Neil; Zilberstein, Shlomo

Learning Generalized Plans Using Abstract Counting Conference

Proceedings of the 23rd Conference on Artificial Intelligence (AAAI), Chicago, Illinois, 2008.

Abstract | Links | BibTeX

Introspective Autonomy

How can autonomous AI systems acquire a model of their own capabilities and limitations, seek human assistance when needed, and become progressively independent?

Show Related Publications

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Belief State Determination for Real-Time Decision-Making Miscellaneous

2024, (US Patent 11,921,506).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Objective-Based Reasoning in Autonomous Vehicle Decision-Making Miscellaneous

2024, (US Patent 11,899,454).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Shared Autonomous Vehicle Operational Management Miscellaneous

2024, (US Patent 11,874,120).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo; Bentahar, Omar; Jamgochian, Arec

Explainability of Autonomous Vehicle Decision Making Miscellaneous

2023, (US Patent 11,714,971).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Autonomous Vehicle Operation with Explicit Occlusion Reasoning Miscellaneous

2023, (US Patent 11,702,070).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Risk Aware Executor with Action Set Recommendations Miscellaneous

2023, (US Patent 11,635,758).

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan; Biswas, Joydeep; Zilberstein, Shlomo

Competence-Aware Systems Journal Article

In: Artificial Intelligence (AIJ), iss. 316, pp. 103844, 2023.

Abstract | Links | BibTeX

@article{SZ:BSWWBZaij23,

title = {Competence-Aware Systems},

author = {Connor Basich and Justin Svegliato and Kyle Hollins Wray and Stefan Witwicki and Joydeep Biswas and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/BSWWBZaij23.pdf},

doi = {10.1016/j.artint.2022.103844},

year  = {2023},

date = {2023-03-16},

urldate = {2023-03-16},

journal = {Artificial Intelligence (AIJ)},

issue = {316},

pages = {103844},

abstract = {Building autonomous systems for deployment in the open world has been a longstanding objective in both artificial intelligence and robotics. The open world, however, presents challenges that question some of the assumptions often made in contemporary AI models. Autonomous systems that operate in the open world face complex, non-stationary environments wherein enumerating all situations the system may face over the course of its deployment is intractable. Nevertheless, these systems are expected to operate safely and reliably for extended durations. Consequently, AI systems often rely on some degree of human assistance to mitigate risks while completing their tasks, and are hence better treated as semi-autonomous systems. In order to reduce unnecessary reliance on humans and optimize autonomy, we propose a novel introspective planning model—competence-aware systems (CAS)—that enables a semi-autonomous system to reason about its own competence and allowed level of autonomy by leveraging human feedback or assistance. A CAS learns to adjust its level of autonomy based on experience and interactions with a human authority so as to reduce improper reliance on the human and optimize the degree of autonomy it employs in any given circumstance. To handle situations in which the initial CAS model has insufficient state information to properly discriminate feedback received from humans, we introduce a methodology called iterative state space refinement that gradually increases the granularity of the state space online. The approach exploits information that exists in the standard CAS model and requires no additional input from the human. The result is an agent that can more confidently predict the correct feedback from the human authority in each level of autonomy, enabling it learn its competence in a larger portion of the state space.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Learning Safety and Human-Centered Constraints in Autonomous Vehicles Miscellaneous

2023, (US Patent 11,613,269).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Bentahar, Omar; Vagadia, Astha; Cesafsky, Laura; Jamgochian, Arec; Witwicki, Stefan; Baig, Najamuddin Mirza; Gyorfi, Julius S; Zilberstein, Shlomo; Sharma, Sparsh

Explainability of Autonomous Vehicle Decision Making Miscellaneous

2023, (US Patent 11,577,746).

Abstract | Links | BibTeX

Wray, Kyle; Witwicki, Stefan; Zilberstein, Shlomo; Pedersen, Liam

Autonomous Vehicle Operational Management Including Operating a Partially Observable Markov Decision Process Model Instance Miscellaneous

2022, (US Patent 11,500,380).

Abstract | Links | BibTeX

Basich, Connor; Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Introspective Competence Modeling for AV Decision Making Miscellaneous

2022, (US Patent 11,307,585).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan; Zilberstein, Shlomo

Multiple Objective Explanation and Control Interface Design Miscellaneous

2022, (US Patent 11,300,957).

Abstract | Links | BibTeX

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning Via Introspective Perception Journal Article

In: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3218–3225, 2022.

Abstract | Links | BibTeX

Ostafew, Christopher; Vagadia, Astha; Baig, Najamuddin; James, Viju; Witwicki, Stefan; Zilberstein, Shlomo

Exception Situation Playback for Tele-Operators Miscellaneous

2022, (US Patent 11,215,987).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Bentahar, Omar; Jamgochian, Arec

Explainability of Autonomous Vehicle Decision Making Miscellaneous

2021, (US Patent App. 16/778,890).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Reinforcement and Model Learning for Vehicle Operation Miscellaneous

2021, (US Patent 11,027,751).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Risk Aware Executor with Action Set Recommendations Miscellaneous

2021, (US Patent App. 16/696,235).

Abstract | Links | BibTeX

Basich, Connor; Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Introspective Competence Modeling for AV Decision Making Miscellaneous

2021, (US Patent App. 16/668,584).

Abstract | Links | BibTeX

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning via Introspective Perception Journal Article

In: CoRR, vol. abs/2109.13974, 2021.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Multiple Objective Explanation and Control Interface Design Miscellaneous

2021, (US Patent App. 16/727,038).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Shared Autonomous Vehicle Operational Management Miscellaneous

2021, (US Patent App. 16/955,531).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Centralized Shared Autonomous Vehicle Operational Management Miscellaneous

2021, (US Patent App. 16/955,531).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Autonomous Vehicle Operation with Explicit Occlusion Reasoning Miscellaneous

2021, (US Patent App. 16/753,601).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Learning Safety and Human-Centered Constraints in Autonomous Vehicles Miscellaneous

2021, (US Patent App. 16/724,635).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Objective-Based Reasoning in Autonomous Vehicle Decision-Making Miscellaneous

2021, (US Patent App. 16/695,613).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Pedersen, Liam

Autonomous Vehicle Operational Management Control Miscellaneous

2020, (US Patent 10,654,476).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Pedersen, Liam

Autonomous Vehicle Operational Management Including Operating A Partially Observable Markov Decision Process Model Instance Miscellaneous

2020, (US Patent App. 16/473,148).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Pedersen, Liam

Autonomous Vehicle Operational Management Blocking Monitoring Miscellaneous

2020, (US Patent App. 16/473,037).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Cefkin, Melissa

Orientation-Adjust Actions for Autonomous Vehicle Operational Management Miscellaneous

2020, (US Patent App. 16/023,710).

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan J; Biswas, Joydeep; Zilberstein, Shlomo

Learning to Optimize Autonomy in Competence-Aware Systems Conference

Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Auckland, New Zealand, 2020.

Abstract | Links | BibTeX

Miura, Shuwa; Zilberstein, Shlomo

Maximizing Plan Legibility in Stochastic Environments (Extended Abstract) Conference

Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Auckland, New Zealand, 2020.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Kamar, Ece; Zilberstein, Shlomo

Mitigating the Negative Side Effects of Reasoning with Imperfect Models: A Multi-Objective Approach (Extended Abstract) Conference

Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Auckland, New Zealand, 2020.

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan J; Biswas, Joydeep; Zilberstein, Shlomo

Learning to Optimize Autonomy in Competence-Aware Systems Journal Article

In: CoRR, vol. abs/2003.07745, 2020.

Abstract | Links | BibTeX

Parr, Shane; Khatri, Ishan; Svegliato, Justin; Zilberstein, Shlomo

Agent-Aware State Estimation: Effective Traffic Light Classification for Autonomous Vehicles Conference

ICRA 2020 Workshop on Sensing, Estimating and Understanding the Dynamic World, 2020.

Abstract | Links | BibTeX

Svegliato, Justin; Witwicki, Stefan J; Wray, Kyle Hollins; Zilberstein, Shlomo

Introspective Autonomous Vehicle Operational Management Miscellaneous

2020, (US Patent 10,649,453).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Policy Networks: A Framework for Scalable Integration of Multiple Decision-Making Models (Extended Abstract) Conference

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Montreal, Quebec, CA, 2019.

Abstract | Links | BibTeX

Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan J; Biswas, Joydeep; Zilberstein, Shlomo

Belief Space Metareasoning for Exception Recovery Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo; Pedersen, Liam

Autonomous Vehicle Operational Management Control Miscellaneous

2019, (US Patent App. 16/472,573).

Abstract | Links | BibTeX

Wray, Kyle Hollins; Shaw, Julie A.; Stone, Peter; Witwicki, Stefan J.; Zilberstein, Shlomo (Ed.)

Proceedings of the AAAI Fall Symposium on Reasoning and Learning in Real-World Systems for Long-Term Autonomy Proceedings

Arlington, VA, 2018.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Zilberstein, Shlomo

Policy Networks for Reasoning in Long-Term Autonomy Conference

AAAI Fall Symposium on Reasoning and Learning in Real-World Systems for Long-Term Autonomy (LTA), Arlington, Virginia, 2018.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Witwicki, Stefan J; Zilberstein, Shlomo

Online Decision Making for Scalable Autonomous Systems Conference

Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), 2017.

Abstract | Links | BibTeX

Wray, Kyle Hollins; Pineda, Luis Enrique; Zilberstein, Shlomo

Hierarchical Approach to Transfer of Control in Semi-Autonomous Systems Conference

Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI), New York, NY, 2016.

Abstract | Links | BibTeX

Zilberstein, Shlomo

Building Strong Semi-Autonomous Systems Conference

Proceedings of the 29th Conference on Artificial Intelligence (AAAI), Austin, Texas, 2015.

Abstract | Links | BibTeX

Mouaddib, Abdel-Illah; Jeanpierre, Laurent; Zilberstein, Shlomo

Handling Advice in MDPs for Semi-Autonomous Systems Conference

ICAPS Workshop on Planning and Robotics (PlanRob), Jerusalem, Israel, 2015.

Abstract | Links | BibTeX

Mouaddib, Abdel-Illah; Zilberstein, Shlomo; Beynier, Aurelie; Jeanpierre, Laurent

A Decision-Theoretic Approach to Cooperative Control and Adjustable Autonomy Conference

Proceedings of the 9th European Conference on Artificial Intelligence (ECAI), Lisbon, Portugal, 2010.

Abstract | Links | BibTeX

Building Safe AI systems

How can we create AI systems that are safe, transparent, and ethical?

Show Related Publications

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Choudhury, Moumita; Saisubramanian, Sandhya; Zhang, Hao; Zilberstein, Shlomo

Minimizing Negative Side Effects in Cooperative Multi-Agent Systems Using Distributed Coordination Conference

Proceedings of the The 37th International FLAIRS Conference, Miramar Beach, Florida, 2024.

Abstract | Links | BibTeX

Basich, Connor; Svegliato, Justin; Wray, Kyle Hollins; Witwicki, Stefan; Biswas, Joydeep; Zilberstein, Shlomo

Competence-Aware Systems Journal Article

In: Artificial Intelligence (AIJ), iss. 316, pp. 103844, 2023.

Abstract | Links | BibTeX

@article{SZ:BSWWBZaij23,

title = {Competence-Aware Systems},

author = {Connor Basich and Justin Svegliato and Kyle Hollins Wray and Stefan Witwicki and Joydeep Biswas and Shlomo Zilberstein},

url = {http://rbr.cs.umass.edu/shlomo/papers/BSWWBZaij23.pdf},

doi = {10.1016/j.artint.2022.103844},

year  = {2023},

date = {2023-03-16},

urldate = {2023-03-16},

journal = {Artificial Intelligence (AIJ)},

issue = {316},

pages = {103844},

abstract = {Building autonomous systems for deployment in the open world has been a longstanding objective in both artificial intelligence and robotics. The open world, however, presents challenges that question some of the assumptions often made in contemporary AI models. Autonomous systems that operate in the open world face complex, non-stationary environments wherein enumerating all situations the system may face over the course of its deployment is intractable. Nevertheless, these systems are expected to operate safely and reliably for extended durations. Consequently, AI systems often rely on some degree of human assistance to mitigate risks while completing their tasks, and are hence better treated as semi-autonomous systems. In order to reduce unnecessary reliance on humans and optimize autonomy, we propose a novel introspective planning model—competence-aware systems (CAS)—that enables a semi-autonomous system to reason about its own competence and allowed level of autonomy by leveraging human feedback or assistance. A CAS learns to adjust its level of autonomy based on experience and interactions with a human authority so as to reduce improper reliance on the human and optimize the degree of autonomy it employs in any given circumstance. To handle situations in which the initial CAS model has insufficient state information to properly discriminate feedback received from humans, we introduce a methodology called iterative state space refinement that gradually increases the granularity of the state space online. The approach exploits information that exists in the standard CAS model and requires no additional input from the human. The result is an agent that can more confidently predict the correct feedback from the human authority in each level of autonomy, enabling it learn its competence in a larger portion of the state space.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Kamath, Aishwarya; Saisubramanian, Sandhya; Paruchuri, Praveen; Kumar, Akshat; Zilberstein, Shlomo

Planning and Learning for Non-Markovian Negative Side Effects Using Finite State Controllers Conference

Proceedings of the 37th Conference on Artificial Intelligence (AAAI), 2023.

Abstract | Links | BibTeX

Basich, Connor; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Autonomy: An Essential Skill for Robots in the Real World Conference

Proceedings of the 37th Conference on Artificial Intelligence (AAAI) Bridge Program, 2023.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Basich, Connor; Zilberstein, Shlomo

Semi-Autonomous Systems with Contextual Competence Awareness Conference

Proceedings of the 22nd International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), 2023.

Abstract | Links | BibTeX

Nashed, Samer B.; Mahmud, Saaduddin; Goldman, Claudia V.; Zilberstein, Shlomo

Causal Explanations for Sequential Decision Making Under Uncertainty (Extended Abstract) Conference

Proceedings of the 22nd International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), 2023.

Abstract | Links | BibTeX

Mahmud, Saaduddin; Saisubramanian, Sandhya; Zilberstein, Shlomo

Explanation-Guided Reward Alignment Conference

Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI), 2023.

Abstract | Links | BibTeX

Basich, Connor; Mahmud, Sadduddin; Zilberstein, Shlomo

Learning Constraints on Autonomous Behavior from Proactive Feedback Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, Michigan, 2023.

Abstract | Links | BibTeX

Nakamura, Mason; Svegliato, Justin; Nashed, Samer B.; Zilberstein, Shlomo; Russell, Stuart

Formal Composition of Robotic Systems as Contract Programs Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, Michigan, 2023.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo; Kamar, Ece

Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems Journal Article

In: AI Magazine, vol. 42, no. 4, pp. 62–71, 2022.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo; Kamar, Ece

Avoiding Negative Side Effects of Autonomous Systems in the Open World Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 74, pp. 143–177, 2022.

Abstract | Links | BibTeX

@article{SZ:SZKjair22,

title = {Avoiding Negative Side Effects of Autonomous Systems in the Open World},

author = {Sandhya Saisubramanian and Shlomo Zilberstein and Ece Kamar},

url = {https://www.jair.org/index.php/jair/article/view/13581/26799},

doi = {10.1613/jair.1.13581},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Journal of Artificial Intelligence Research (JAIR)},

volume = {74},

pages = {143--177},

abstract = {Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning Via Introspective Perception Journal Article

In: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3218–3225, 2022.

Abstract | Links | BibTeX

Svegliato, Justin; Basich, Connor; Saisubramanian, Sandhya; Zilberstein, Shlomo

Metareasoning for Safe Decision Making in Autonomous Systems Conference

Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Philadelphia, Pennsylvania, 2022.

Abstract | Links | BibTeX

Basich, Connor; Russino, Joseph A.; Chien, Steve; Zilberstein, Shlomo

A Sampling Based Approach to Robust Planning for a Planetary Lander Conference

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022.

Abstract | Links | BibTeX

Svegliato, Justin; Nashed, Samer B; Zilberstein, Shlomo

Ethically Compliant Sequential Decision Making Conference

Proceedings of the 35th Conference on Artificial Intelligence (AAAI), 2021, (Distinguished Paper Award).

Abstract | Links | BibTeX

Galhotra, Sainyam; Saisubramanian, Sandhya; Zilberstein, Shlomo

Learning to Generate Fair Clusters from Demonstrations Conference

Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021.

Abstract | Links | BibTeX

Nashed, Samer B; Svegliato, Justin; Zilberstein, Shlomo

Ethically Compliant Planning within Moral Communities Conference

Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Roberts, Shannon C; Zilberstein, Shlomo

Understanding User Attitudes Towards Negative Side Effects of AI Systems Conference

CHI Conference on Human Factors in Computing Systems, Late-Breaking Work, 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Mitigating Negative Side Effects via Environment Shaping (Extended Abstract) Conference

Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Mitigating Negative Side Effects via Environment Shaping Journal Article

In: CoRR, vol. abs/2102.07017, 2021.

Abstract | Links | BibTeX

Rabiee, Sadegh; Basich, Connor; Wray, Kyle Hollins; Zilberstein, Shlomo; Biswas, Joydeep

Competence-Aware Path Planning via Introspective Perception Journal Article

In: CoRR, vol. abs/2109.13974, 2021.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Kamar, Ece; Zilberstein, Shlomo

A Multi-Objective Approach to Mitigate Negative Side Effects Conference

Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI), 2020, (Distinguished Paper Award).

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Galhotra, Sainyam; Zilberstein, Shlomo

Balancing the Tradeoff Between Clustering Value and Interpretability Conference

Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society (AIES), New York, NY, 2020.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Minimizing the Negative Side Effects of Planning with Reduced Models Conference

AAAI Workshop on Artificial Intelligence Safety, Honolulu, Hawaii, 2019.

Abstract | Links | BibTeX

Saisubramanian, Sandhya; Zilberstein, Shlomo

Safe Reduced Models for Probabilistic Planning Conference

ICML/IJCAI/AAMAS Workshop on Planning and Learning (PAL), Stockholm, Sweden, 2018.

Abstract | Links | BibTeX

Freedman, Richard G; Zilberstein, Shlomo

Safety in AI-HRI: Challenges Complementing User Experience Quality Conference

AAAI Fall Symposium on Artificial Intelligence and Human-Robot Interaction (AI-HRI), Arlington, Virginia, 2016.

Abstract | Links | BibTeX

Plan and Activity Recognition

How can agents recognize the plans, activities, and intents of other agents and use that information to plan their response?

Show Related Publications

Miura, Shuwa; Zilberstein, Shlomo

Observer-Aware Planning with Implicit and Explicit Communication Conference

Proceedings of the The 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Auckland, New Zealand, 2024.

Abstract | Links | BibTeX

Nashed, Samer; Zilberstein, Shlomo

A Survey of Opponent Modeling in Adversarial Domains Journal Article

In: Journal of Artificial Intelligence Research (JAIR), vol. 73, pp. 277–327, 2022.

Abstract | Links | BibTeX

Miura, Shuwa; Cohen, Andrew L; Zilberstein, Shlomo

Maximizing Legibility in Stochastic Environments Conference

Proceedings of the 30th IEEE International Conference on Robot & Human Interactive Communication, (RO-MAN), Vancouver, BC, Canada, 2021.

Abstract | Links | BibTeX

Miura, Shuwa; Zilberstein, Shlomo

A Unifying Framework for Observer-Aware Planning and its Complexity Conference

Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence (UAI), Virtual Event, 2021.

Abstract | Links | BibTeX

Wayllace, Christabel; Keren, Sarah; Gal, Avigdor; Karpas, Erez; Yeoh, William; Zilberstein, Shlomo

Accounting for Observer's Partial Observability in Stochastic Goal Recognition Design Conference

Proceedings of the 24th European Conference on Artificial Intelligence (ECAI), 2020.

Abstract | Links | BibTeX

Dwaraki, Abhishek; Freedman, Richard G; Zilberstein, Shlomo; Wolf, Tilman

Using Natural Language Constructs and Concepts to Aid Network Management Conference

Proceedings of the International Conference on Computing, Networking and Communications, Honolulu, Hawaii, 2019.

Abstract | Links | BibTeX

Keren, Sarah; Pineda, Luis Enrique; Gal, Avigdor; Karpas, Erez; Zilberstein, Shlomo

Responsive Planning and Recognition for Closed-Loop Interaction Conference

Proceedings of the 29th International Conference on Automated Planning and Scheduling (ICAPS), Berkeley, CA, 2019.

Abstract | Links | BibTeX

Freedman, Richard G; Fung, Yi Ren; Ganchin, Roman; Zilberstein, Shlomo

Towards Quicker Probabilistic Recognition with Multiple Goal Heuristic Search Conference

AAAI Workshop on Plan, Activity, and Intent Recognition (PAIR), New Orleans, Louisiana, 2018.

Abstract | Links | BibTeX

Freedman, Richard G; Zilberstein, Shlomo

Roles that Plan, Activity, and Intent Recognition with Planning Can Play in Games Conference

AAAI Workshop on Knowledge Extraction from Games (KEG), New Orleans, Louisiana, 2018.

Abstract | Links | BibTeX

Freedman, Richard G; Zilberstein, Shlomo

Integration of Planning with Recognition for Responsive Interaction Using Classical Planners Conference

Proceedings of the 31st Conference on Artificial Intelligence (AAAI), San Francisco, California, 2017.

Abstract | Links | BibTeX

Keren, Sarah; Pineda, Luis Enrique; Gal, Avigdor; Karpas, Erez; Zilberstein, Shlomo

Equi-Reward Utility Maximizing Design in Stochastic Environments Conference

Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, 2017.

Abstract | Links | BibTeX

Freedman, Richard G; Zilberstein, Shlomo

Automated Interpretations of Unsupervised Learning-Derived Clusters for Activity Recognition Conference

Ro-Man Workshop on Learning for Human-Robot Collaboration, Kobe, Japan, 2015.

Freedman, Richard G; Jung, Hee-Tae; Zilberstein, Shlomo

Temporal and Object Relations in Unsupervised Plan and Activity Recognition Conference

AAAI Fall Symposium on Artificial Intelligence and Human-Robot Interaction (AI-HRI), Arlington, Virginia, 2015.

Abstract | Links | BibTeX

Freedman, Richard G; Jung, Hee-Tae; Zilberstein, Shlomo

Plan and Activity Recognition from a Topic Modeling Perspective Conference

Proceedings of the 24thInternational Conference on Automated Planning and Scheduling (ICAPS), Portsmouth, New Hampshire, 2014.

Abstract | Links | BibTeX

Stochastic Network Design and Optimization

How to develop scalable algorithms to optimize diffusion processes and use them to control the spread of various phenomena such as information over a social network or species over fragmented landscape?

Show Related Publications

Wu, Xiaojian; Kumar, Akshat; Sheldon, Daniel; Zilberstein, Shlomo

Robust Optimization for Tree-Structured Stochastic Network Design Conference

Proceedings of the 31st Conference on Artificial Intelligence (AAAI), San Francisco, California, 2017, (Best Paper Award).

Abstract | Links | BibTeX

Wu, Xiaojian; Sheldon, Daniel; Zilberstein, Shlomo

Optimizing Resilience in Large Scale Networks Conference

Proceedings of the 30th Conference on Artificial Intelligence (AAAI), Phoenix, Arizona, 2016.

Abstract | Links | BibTeX

Wu, Xiaojian; Sheldon, Daniel; Zilberstein, Shlomo

Fast Combinatorial Algorithm for Optimizing the Spread of Cascades Conference

Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI), Buenos Aires, Argentina, 2015.

Abstract | Links | BibTeX

Wu, Xiaojian; Sheldon, Daniel; Zilberstein, Shlomo

Rounded Dynamic Programming for Tree-Structured Stochastic Network Design Conference

Proceedings of the 28th Conference on Artificial Intelligence (AAAI), Quebec City, Canada, 2014.

Abstract | Links | BibTeX

Wu, Xiaojian; Sheldon, Daniel; Zilberstein, Shlomo

Stochastic Network Design in Bidirected Trees Conference

Proceedings of the 28th Neural Information Processing Systems Conference (NIPS), Montreal, Canada, 2014.

Abstract | Links | BibTeX

Wu, Xiaojian; Kumar, Akshat; Sheldon, Daniel; Zilberstein, Shlomo

Parameter Learning for Latent Network Diffusion Conference

Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo; Toussaint, Marc

Message-Passing Algorithms for MAP Estimation Using DC Programming Conference

Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS), La Palma, Canary Islands, 2012.

Abstract | Links | BibTeX

Kumar, Akshat; Wu, Xiaojian; Zilberstein, Shlomo

Lagrangian Relaxation Techniques for Scalable Spatial Conservation Planning Conference

Proceedings of the 26th Conference on Artificial Intelligence (AAAI), Toronto, Canada, 2012.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

Message-Passing Algorithms for Quadratic Programming Formulations of MAP Estimation Conference

Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI), Barcelona, Spain, 2011.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

On Message-Passing, MAP Estimation in Graphical Models and DCOPs Conference

International Workshop on Distributed Constraint Reasoning (DCR), Barcelona, Spain, 2011.

Abstract | Links | BibTeX

Kumar, Akshat; Zilberstein, Shlomo

MAP Estimation for Graphical Models by Likelihood Maximization Conference

Proceedings of the 24th Neural Information Processing Systems Conference (NIPS), Vancouver, British Columbia, Canada, 2010.

Abstract | Links | BibTeX