20012025

Research activity per year

Filter
Conference contribution

Search results

  • 2023

    CALM: Conditional Adversarial Latent Models for Directable Virtual Characters

    Tessler, C., Kasten, Y., Guo, Y., Mannor, S., Chechik, G. & Peng, X. B., 23 Jul 2023, Proceedings - SIGGRAPH 2023 Conference Papers. Spencer, S. N. (ed.). 37. (Proceedings - SIGGRAPH 2023 Conference Papers).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

    Fuhrer, B., Shpigelman, Y., Tessler, C., Mannor, S., Chechik, G., Zahavi, E. & Dalal, G., 2023, Proceedings - 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2023. Simmhan, Y., Altintas, I., Varbanescu, A.-L., Balaji, P., Prasad, A. S. & Carnevale, L. (eds.). p. 331-343 13 p. (Proceedings - 23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2023).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Optimization or Architecture: How to Hack Kalman Filtering

    Greenberg, I., Yannay, N. & Mannor, S., 2023, Advances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023. Oh, A., Neumann, T., Globerson, A., Saenko, K., Hardt, M. & Levine, S. (eds.). (Advances in Neural Information Processing Systems; vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Optimization or Architecture: What Matters in Non-Linear Filtering?

    Greenberg, I., Yannay, N. & Mannor, S., 2023, 1st Workshop on the Synergy of Scientific and Machine Learning Modeling @ ICML2023.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Planning and Learning with Adaptive Lookahead

    Rosenberg, A., Hallak, A., Mannor, S., Chechik, G. & Dalal, G., 27 Jun 2023, AAAI-23 Technical Tracks 8. Williams, B., Chen, Y. & Neville, J. (eds.). p. 9606-9613 8 p. (Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023; vol. 37).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Towards Faster Global Convergence of Robust Policy Gradient Methods

    Kumar, N., Usmanova, I., Levy, K. Y. & Mannor, S., 2023, Sixteenth European Workshop on Reinforcement Learning. 13 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • 2022

    Efficient Risk-Averse Reinforcement Learning

    Greenberg, I., Chow, Y., Ghavamzadeh, M. & Mannor, S., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Finite Sample Analysis Of Dynamic Regression Parameter Learning

    Kozdoba, M., Moroshko, E., Mannor, S. & Crammer, K., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Kalman Filter Is All You Need: Optimization Works When Noise Estimation Fails

    Greenberg, I., Mannor, S. & Yannay, N., 2022, Tenth International Conference on Learning Representations.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning

    Zohar, R., Mannor, S. & Tennenholtz, G., 30 Jun 2022, AAAI-22 Technical Tracks 8. p. 9278-9285 8 p. (Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022; vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Online Apprenticeship Learning

    Shani, L., Zahavy, T. & Mannor, S., 30 Jun 2022, AAAI-22 Technical Tracks 8. p. 8240-8248 9 p. (Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022; vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Reinforcement Learning for Datacenter Congestion Control

    Tessler, C., Shpigelman, Y., Dalal, G., Mandelbaum, A., Kazakov, D. H., Fuhrer, B., Chechik, G. & Mannor, S., 30 Jun 2022, IAAI-22, EAAI-22, AAAI-22 Special Programs and Special Track, Student Papers and Demonstrations. p. 12615-12621 7 p. (Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022; vol. 36).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Reinforcement Learning with a Terminator

    Tennenholtz, G., Merlis, N., Shani, L., Mannor, S., Shalit, U., Chechik, G., Hallak, A. & Dalal, G., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Tractable Optimality in Episodic Latent MABs

    Kwon, J., Efroni, Y., Caramanis, C. & Mannor, S., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Two Regimes of Generalization for Non-Linear Metric Learning

    Kozdoba, M. & Mannor, S., 2022, Tenth International Conference on Learning Representations. 11 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning

    Tennenholtz, G. & Mannor, S., 2022, Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022. Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K. & Oh, A. (eds.). (Advances in Neural Information Processing Systems; vol. 35).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • 2021

    Action Redundancy in Reinforcement Learning

    Baram, N., Tennenholtz, G. & Mannor, S., 2021, 37th Conference on Uncertainty in Artificial Intelligence, UAI 2021. p. 376-385 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Bandits with partially observable confounded data

    Tennenholtz, G., Shalit, U., Mannor, S. & Efroni, Y., 2021, Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, UAI 2021, Virtual Event, 27-30 July 2021. Campos, C. P. D., Maathuis, M. H. & Quaeghebeur, E. (eds.). Vol. 161. p. 430-439 10 p. (Proceedings of Machine Learning Research).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Confidence-Budget Matching for Sequential Budgeted Learning

    Efroni, Y., Merlis, N., Saha, A. & Mannor, S., 2021, Proceedings of the 38th International Conference on Machine Learning, ICML 2021. p. 2937-2947 11 p. (Proceedings of Machine Learning Research; vol. 139).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

    Meirom, E. A., Maron, H., Mannor, S. & Chechik, G., 2021, Proceedings of the 38th International Conference on Machine Learning, ICML 2021. p. 7565-7577 13 p. (Proceedings of Machine Learning Research; vol. 139).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Detecting Rewards Deterioration in Episodic Reinforcement Learning

    Greenberg, I. & Mannor, S., 2021, Proceedings of the 38th International Conference on Machine Learning, ICML 2021. p. 3842-3853 12 p. (Proceedings of Machine Learning Research; vol. 139).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering

    Greenberg, I. & Mannor, S., 2021, ICLR 2021. 13 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

    Hallak, A., Dalal, G., Dalton, S., Frosio, I., Mannor, S. & Chechik, G., 2021, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P. S. & Wortman Vaughan, J. (eds.). p. 5518-5530 13 p. (Advances in Neural Information Processing Systems; vol. 7).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Latent Geodesics of Model Dynamics for Offline Reinforcement Learning

    Tennenholtz, G., Baram, N. & Mannor, S., 2021, Deep RL Workshop NeurIPS 2021. 21 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Learning Safe Policies with Cost-sensitive Advantage Estimation

    Kang, B., Mannor, S. & Feng, J., 2021, ICLR 2021 Conference.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Lenient Regret for Multi-Armed Bandits

    Merlis, N. & Mannor, S., 2021, 35th AAAI Conference on Artificial Intelligence, AAAI 2021. p. 8950-8957 8 p. (35th AAAI Conference on Artificial Intelligence, AAAI 2021; vol. 10B).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

    Tennenholtz, G., Hallak, A., Dalal, G., Mannor, S., Chechik, G. & Shalit, U., 2021, International Conference on Learning Representations.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Online Limited Memory Neural-Linear Bandits with Likelihood Matching

    Nabati, O., Zahavy, T. & Mannor, S., 2021, Proceedings of the 38th International Conference on Machine Learning, ICML 2021. p. 7905-7915 11 p. (Proceedings of Machine Learning Research; vol. 139).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • On the Volatility of Optimal Control Policies of a Class of Linear Quadratic Regulators

    Mohan, A., Mannor, S. & Kizilkale, A. C., 25 May 2021, 2021 American Control Conference, ACC 2021. p. 4533-4540 8 p. 9482645. (Proceedings of the American Control Conference; vol. 2021-May).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning

    Khadka, S., Aflalo, E., Marder, M., Ben-David, A., Miret, S., Mannor, S., Hazan, T., Tang, H. & Majumdar, S., 2021, 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Over-the-Air Adversarial Flickering Attacks against Video Recognition Networks

    Pony, R., Naeh, I. & Mannor, S., 2021, Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021. p. 515-524 10 p. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Reinforcement Learning in Reward-Mixing MDPs

    Kwon, J., Caramanis, C., Efroni, Y. & Mannor, S., 2021, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P. S. & Wortman Vaughan, J. (eds.). p. 2253-2264 12 p. (Advances in Neural Information Processing Systems; vol. 3).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Reinforcement Learning with Trajectory Feedback

    Efroni, Y., Merlis, N. & Mannor, S., 2021, 35th AAAI Conference on Artificial Intelligence, AAAI 2021. p. 7288-7295 8 p. (35th AAAI Conference on Artificial Intelligence, AAAI 2021; vol. 8B).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • RL for Latent MDPs: Regret Guarantees and a Lower Bound

    Kwon, J., Efroni, Y., Caramanis, C. & Mannor, S., 2021, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P. S. & Wortman Vaughan, J. (eds.). p. 24523-24534 12 p. (Advances in Neural Information Processing Systems; vol. 29).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Robust Value Iteration for Continuous Control Tasks

    Lutter, M., Mannor, S., Peters, J., Fox, D. & Garg, A., 2021, Robotics: Science and Systems XVII. Shell, D. A., Toussaint, M. & Hsieh, M. A. (eds.). (Robotics: Science and Systems).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Sim and Real: Better Together

    Di Castro Shashua, S., Mannor, S. & Di Castro, D., 2021, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P. S. & Wortman Vaughan, J. (eds.). p. 6868-6880 13 p. (Advances in Neural Information Processing Systems; vol. 9).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Twice regularized MDPs and the equivalence between robustness and regularization

    Derman, E., Geist, M. & Mannor, S., 2021, Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021. Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P. S. & Wortman Vaughan, J. (eds.). p. 22274-22287 14 p. (Advances in Neural Information Processing Systems; vol. 27).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Value Iteration in Continuous Actions, States and Time

    Lutter, M., Mannor, S., Peters, J., Fox, D. & Garg, A., 2021, Proceedings of the 38th International Conference on Machine Learning, ICML 2021. p. 7224-7234 11 p. (Proceedings of Machine Learning Research; vol. 139).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • 2020

    Adaptive Trust Region Policy Optimization - Global Convergence and Faster Rates for Regularized MDPs. Global convergence and faster rates for regularized MDPs

    Shani, L., Efroni, Y. & Mannor, S., 2020, AAAI 2020 - 34th AAAI Conference on Artificial Intelligence. p. 5668-5675 8 p. (AAAI 2020 - 34th AAAI Conference on Artificial Intelligence).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Contextual Inverse Reinforcement Learning

    Korsunsky, P., Belogolovsky, S., Zahavy, T., Tessler, C. & Mannor, S., 2020, Eighth International Conference on Learning Representations. 23 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching

    Zahavy, T. & Mannor, S., 2020, Eighth International Conference on Learning Representations.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Off-policy evaluation in partially observable environments

    Tennenholtz, G., Shie, M. & Shalit, U., 2020, AAAI 2020 - 34th AAAI Conference on Artificial Intelligence. p. 10276-10283 8 p. (AAAI 2020 - 34th AAAI Conference on Artificial Intelligence).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Open Access
  • Optimistic policy optimization with bandit feedback

    Efroni, Y., Shani, L., Rosenberg, A. & Mannor, S., 2020, 37th International Conference on Machine Learning, ICML 2020. Daume, H. & Singh, A. (eds.). p. 8562-8571 10 p. (37th International Conference on Machine Learning, ICML 2020; vol. PartF168147-12).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Scalable detection of offensive and non-compliant content / logo in product images

    Gandhi, S., Kokkula, S., Chaudhuri, A., Magnani, A., Stanley, T., Ahmadi, B., Kandaswamy, V., Ovenc, O. & Mannor, S., Mar 2020, Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020. p. 2236-2245 10 p. 9093454. (Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients

    Tessler, C., Merlis, N. & Mannor, S., 2020, Eighth International Conference on Learning Representations. 16 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Topic modeling via full dependence mixtures

    Fisher, D., Kozdoba, M. & Mannor, S., 2020, 37th International Conference on Machine Learning, ICML 2020. Daume, H. & Singh, A. (eds.). p. 3169-3179 11 p. (37th International Conference on Machine Learning, ICML 2020; vol. PartF168147-5).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • 2019

    A Bayesian approach to robust reinforcement learning

    Derman, E., Mankowitz, D., Mann, T. & Mannor, S., 2019, 35th Conference on Uncertainty in Artificial Intelligence, UAI 2019.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Action robust reinforcement learning and applications in continuous control

    Tessler, C., Efroni, Y. & Mannor, S., 2019, 36th International Conference on Machine Learning, ICML 2019. p. 10846-10855 10 p. (36th International Conference on Machine Learning, ICML 2019; vol. 2019-June).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Exploration conscious reinforcement learning revisited

    Shani, L., Efroni, Y. & Mannor, S., 2019, 36th International Conference on Machine Learning, ICML 2019. p. 9986-10012 27 p. (36th International Conference on Machine Learning, ICML 2019; vol. 2019-June).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • How to combine tree-search methods in reinforcement learning

    Efroni, Y., Dalal, G., Scherrer, B. & Mannor, S., 2019, 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019. p. 3494-3501 8 p. (33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review