CLIP-Event: Connecting Vision and Text with Event Structures

test

CLIP-Event
Manling Li
Manling Li
Assistant Professor

I study reasoning and planning in multimodal foundation models.