You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was wondering whether you have a plan to share the code for semi-automated dataset generation (the pipeline of using Katna to extract keyframes -> using BLIP2 & GRIT to generate frame-wise captions -> filtering with Tag2Text). If not, is it possible to share the generated dense captions from these large vision models?
Thank you!
The text was updated successfully, but these errors were encountered:
Hi @mmaaz60, thanks for sharing this great work!
I was wondering whether you have a plan to share the code for semi-automated dataset generation (the pipeline of using Katna to extract keyframes -> using BLIP2 & GRIT to generate frame-wise captions -> filtering with Tag2Text). If not, is it possible to share the generated dense captions from these large vision models?
Thank you!
The text was updated successfully, but these errors were encountered: