Saturday, 18 March 2023

New top story on Hacker News: Vid2Seq: A pretrained visual language model for describing multi-event videos

Vid2Seq: A pretrained visual language model for describing multi-event videos
16 by og_kalu | 3 comments on Hacker News.


No comments:

Post a Comment