Cataloguing a vinyl collection with computer vision
James contemplated what many folks do: I really should make a list of all of my vinyl records.
I decided to build a vinyl cataloguing tool powered by computer vision. The tool allows you to set up a webcam and saves every frame where a unique vinyl record is found. Those frames are then sent to ChatGPT to retrieve meta information about the album. Finally, the results are saved in a CSV file.
With this idea in mind, I started to think about how I could build it. I could use an LLM that accepts video inputs, although I was worried about records getting missed. I wanted a system where, if anything went wrong, I was able to interpret the results; if a vinyl could not be identified, I would rather have an error state than a missing record. Plus, I was not keen on the higher costs associated with having an entire video processed by an LLM, with all the redundant data that would be in the video.
The project demonstrates how an indexing system can be made using out-of-the-box foundation models: MobileCLIP and GPT-4 with Vision.
You can read the process used in the post here and the source code is available on GitHub.
Have an amazing project to share? The Electronics Show and Tell is every Wednesday at 7:30pm ET! To join, head over to YouTube and check out the show’s live chat and our Discord!
Python for Microcontrollers – Adafruit Daily — Python on Microcontrollers Newsletter: A New Arduino MicroPython Package Manager, How-Tos and Much More! #CircuitPython #Python #micropython @ThePSF @Raspberry_Pi
EYE on NPI – Adafruit Daily — EYE on NPI Maxim’s Himalaya uSLIC Step-Down Power Module #EyeOnNPI @maximintegrated @digikey