Towards multimodal in-context learning for vision & language models S Doveh, S Perek, MJ Mirza, A Alfassy, A Arbelle, S Ullman, L Karlinsky arXiv preprint arXiv:2403.12736, 2024 | 2 | 2024 |
Dense and aligned captions (dac) promote compositional reasoning in vl models S Doveh, A Arbelle, S Harary, R Herzig, D Kim, P Cascante-Bonilla, ... Advances in Neural Information Processing Systems 36, 2024 | 19 | 2024 |
Efficient Rehearsal Free Zero Forgetting Continual Learning using Adaptive Weight Modulation Y Sverdlov, S Ullman arXiv preprint arXiv:2311.15276, 2023 | | 2023 |
Variable resolution: improving scene visual question answering with a limited pixel budget A Gizdov, S Ullman, D Harari | | 2023 |
Human-like scene interpretation by a guided counterstream processing S Ullman, L Assif, A Strugatski, BZ Vatashsky, H Levi, A Netanyahu, ... Proceedings of the National Academy of Sciences 120 (40), e2211179120, 2023 | | 2023 |
Top-Down Processing: Top-Down Network Combines Back-Propagation with Attention R Abel, S Ullman arXiv preprint arXiv:2306.02415, 2023 | | 2023 |
Attention Based Multi-Label Classification of Diabetic Retinopathy from Optical Coherence Tomography D Segev, R Basri, T Batash, I Chowers, D Harari, R Lender, J Levi, ... 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), 1-5, 2023 | | 2023 |
Teaching structured vision & language concepts to vision & language models S Doveh, A Arbelle, S Harary, E Schwartz, R Herzig, R Giryes, R Feris, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 41 | 2023 |
The dynamics of scene understanding D Harari, A Mars, H Benoni, S Ullman Journal of Vision 22 (14), 3555-3555, 2022 | 1 | 2022 |
Augmented reality display system and method S Ullman, D Harari, L Assif, I Koifman US Patent 11,368,670, 2022 | 1 | 2022 |
Gaze following requires early visual experience E Zohary, D Harari, S Ullman, I Ben-Zion, R Doron, S Attias, Y Porat, ... Proceedings of the National Academy of Sciences 119 (20), e2117184119, 2022 | 10 | 2022 |
A model for full local image interpretation G Ben-Yosef, L Assif, D Harari, S Ullman arXiv preprint arXiv:2110.08744, 2021 | 7 | 2021 |
Machine Recognition of Objects T Poggio, S Ullman Computer Vision: A Reference Guide, 781-784, 2021 | 1 | 2021 |
Multi-task learning by a top-down control network H Levi, S Ullman 2021 IEEE International Conference on Image Processing (ICIP), 2553-2557, 2021 | 8 | 2021 |
Oculo-retinal dynamics can explain the perception of minimal recognizable configurations LZ Gruber, S Ullman, E Ahissar Proceedings of the National Academy of Sciences 118 (34), e2022792118, 2021 | 7 | 2021 |
Image interpretation by iterative bottom-up top-down processing S Ullman, L Assif, A Strugatski, BZ Vatashsky, H Levy, A Netanyahu, ... arXiv preprint arXiv:2105.05592, 2021 | 1 | 2021 |
What can human minimal videos tell us about dynamic recognition models? G Ben-Yosef, G Kreiman, S Ullman arXiv preprint arXiv:2104.09447, 2021 | | 2021 |
View-tuned and view-invariant face encoding in IT cortex is explained by selected natural image fragments Y Nam, T Sato, G Uchida, E Malakhova, S Ullman, M Tanifuji Scientific reports 11 (1), 7827, 2021 | 5 | 2021 |
Detector-free weakly supervised grounding by separation A Arbelle, S Doveh, A Alfassy, J Shtok, G Lev, E Schwartz, H Kuehne, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 21 | 2021 |
Object recognition at the level of minimal images develops for up to seconds of presentation time D Harari, H Benoni, S Ullman Journal of Vision 20 (11), 266-266, 2020 | | 2020 |