Case Study on Decomposing an Agentic Framework for Video Moment Retrieval
Investigated an agentic framework for video moment retrieval by dissecting temporal queries using AST-driven structures and semantic operators. Evaluated LLM planners for reasoning over temporal relations and multi-event dependencies. Studied Adaptive Keyframe Sampling as a relevance-aware alternative to uniform sampling for long-video grounding. Developed hierarchical subqueries via spaCy dependency parsing to isolate core action predicates, attributes, and contextual modifiers for improved localization.
#Video Understanding #Temporal Grounding #Query Decomposition #Keyframe Sampling
2025