I am mostly on track with my project. I managed to make a Colab notebook based on Detectron2’s tutorial notebook which has:
- A widget text box that takes in a URL
- Uses youtube-dl to download the video from the URL
- Uses Detectron2 on (for now) one of the frames
- Outputs the mask based on the images (as a numpy array which can be used with openCV)
What I need to do next is:
- Differentiate what segmentation info I care about rather than all info at once
- Use the mask to segment my photo
- Collage the output photos
- Create logic to protect against bad input
- Create a front end interface
I am still deciding on what I want the output collages style to be.