For the end of the semester I’m thinking of using Detectron2 to create a novel tool for collage.
Right now, my idea is to take a YouTube link of one’s favorite cute animal video and use FFMPEG to pull frames from it, then run Detectron2 to mask those images, and finally mask the images (probably with GPU numpy). Then I hope to translate the images and make them into a collage from those results.
My goal for next week would be to take a segmented YouTube video and mask out the background as a test.