Policy gradients with variance related risk criteria A Tamar, D Di Castro, S Mannor Proceedings of the twenty-ninth international conference on machine learning …, 2012 | 340 | 2012 |
Memristor-based multilayer neural networks with online gradient descent training D Soudry, D Di Castro, A Gal, A Kolodny, S Kvatinsky IEEE transactions on neural networks and learning systems 26 (10), 2408-2421, 2015 | 297 | 2015 |
Contextual markov decision processes A Hallak, D Di Castro, S Mannor arXiv preprint arXiv:1502.02259, 2015 | 233 | 2015 |
Learning the variance of the reward-to-go A Tamar, D Di Castro, S Mannor Journal of Machine Learning Research 17 (13), 1-36, 2016 | 86 | 2016 |
Policy gradients with variance related risk criteria D Di Castro, A Tamar, S Mannor arXiv preprint arXiv:1206.6404, 2012 | 68 | 2012 |
You've got mail, and here is what you could do with it! analyzing and predicting actions on email messages D Di Castro, Z Karnin, L Lewin-Eytan, Y Maarek Proceedings of the ninth acm international conference on web search and data …, 2016 | 61 | 2016 |
Temporal difference methods for the variance of the reward to go A Tamar, D Di Castro, S Mannor International Conference on Machine Learning, 495-503, 2013 | 56 | 2013 |
A convergent online single time scale actor critic algorithm DD Castro, R Meir The Journal of Machine Learning Research 11, 367-410, 2010 | 44 | 2010 |
Insertionnet-a scalable solution for insertion O Spector, D Di Castro IEEE Robotics and Automation Letters 6 (3), 5509-5516, 2021 | 39 | 2021 |
Enforcing k-anonymity in web mail auditing D Di Castro, L Lewin-Eytan, Y Maarek, R Wolff, E Zohar Proceedings of the ninth ACM international conference on web search and data …, 2016 | 30 | 2016 |
Model selection in markovian processes A Hallak, D Di-Castro, S Mannor Proceedings of the 19th ACM SIGKDD international conference on Knowledge …, 2013 | 30 | 2013 |
SOLO: search online, learn offline for combinatorial optimization problems J Oren, C Ross, M Lefarov, F Richter, A Taitler, Z Feldman, D Di Castro, ... Proceedings of the international symposium on combinatorial search 12 (1 …, 2021 | 28 | 2021 |
Adaptive bases for reinforcement learning D Di Castro, S Mannor Joint European Conference on Machine Learning and Knowledge Discovery in …, 2010 | 28 | 2010 |
Hand gesture recognition in images and video I Steinberg, TM London, D Di Castro Irwin and Joan Jacobs Center for Communication and Information Technologies …, 2010 | 22 | 2010 |
Temporal difference based actor critic learning-convergence and neural implementation D Castro, D Volkinshtein, R Meir Advances in neural information processing systems 21, 2008 | 22 | 2008 |
A hybrid approach for learning to shift and grasp with elaborate motion primitives Z Feldman, H Ziesche, NA Vien, D Di Castro 2022 International Conference on Robotics and Automation (ICRA), 6365-6371, 2022 | 16 | 2022 |
Insertionnet 2.0: Minimal contact multi-step insertion using multimodal multiview sensory input O Spector, V Tchuiev, D Di Castro 2022 International Conference on Robotics and Automation (ICRA), 6330-6336, 2022 | 15 | 2022 |
Structural clustering of machine-generated mail N Avigdor-Elgrabli, M Cwalinski, D Di Castro, I Gamzu, I Grabovitch-Zuyev, ... Proceedings of the 25th ACM International on Conference on Information and …, 2016 | 15 | 2016 |
Analog multiplier using a memristive device and method for implemening Hebbian learning rules using memrisor arrays D Di Castro, D Soudry, S Kvatinsky, A Gal, A Kolodny US Patent 9,754,203, 2017 | 13 | 2017 |
Method and apparatus for predicting unwanted electronic messages for a user L Lewin-Eytan, G Halawi, D Di Castro, Z Karnin, Y Maarek, M Albers US Patent 10,374,995, 2019 | 12 | 2019 |