I was really impressed with SAM 2 when it first appeared, especially considering how much I work with video. My company uses it a lot, and we decided to try and improve its speed. We managed to make it run twice as fast as the original version!
Video models, unlike language models, are known for being very slow because they are inefficient at reading, storing, and writing files.