TimeLens Sets a New Benchmark: Redefining Video Temporal Grounding with Multimodal Large Language Models
Highlights: Introduces TimeLens, a systematic rethinking of video temporal grounding (VTG) using multimodal large language models (MLLMs). Launches TimeLens-Bench, a high-quality benchmark featuring re-annotated datasets for fairer evaluation. Presents TimeLens-100K,…
