Volunteer Oprotunity: AI Transcription generation for OpenStreetMapsUS
Posted by PhysicsArmature on 22 March 2023 in English. Last updated on 2 May 2023.Goal:
- Find a video on the OpenStreetMaps YouTube channel.
- Download the audio of a talk.
- Run it through OpenAI’s Whisper.
- Send the transcript and the source URL to somebody in the OSM Community who has ownership over the OSM YouTube channel.
While you may be able to automate this, I don’t know how to do so.
What you need:
- GPU (possibly NVIDIA, don’t know). 5gb vram (gpu ram). This might mean RTX 2060 or newer.
- Strong cooling and noise isolation through building design.
Costs
- Electricity will create some cost as transcription is hard. Do note that it is still less then the amount needed to power on and train a normal human being on the same task for several years in addition to the quantity of humans needed to get the same throughput.
- This will result in wear and tare on your drives and other components.
- This will make your computer and room warm in the summer. You need great cooling or the ability to use the excess heat for something valuable.
Steps:
- Install Itch.io to assist updating.
- Install whisper gui frontend by Grisk with Itch.
- Download audio from a talk (not saying how).
- Plug it in and get the result.
- Send the URL of the talk and the transcript to unknownPerson who runs the OSM YouTube Channel in a standard format.
Sample format for an email
Hello noun, This email is to submit a transcript.
talk: https://www.youtube.com/watch?v=nsaiHhQvNSY model: whisper medium
Disclaimers:
- I have yet to coordinate with anyone.
- Human transcript writers are great and needed. They are in short supply. Let us reduce the net demand. They can save their energy for high impact legal and medical environments.
- Maybe the built in YouTube transcript does the job well enough. This might not be worth the effort. I don’t know.
Discussion
Comment from The Wonderful Tartiflette on 24 March 2023 at 21:38
You can already do that without using your own hardware here : https://huggingface.co/spaces/sanchit-gandhi/whisper-large-v2
Comment from 快乐的老鼠宝宝 on 7 April 2023 at 09:32
Is there currently a dashboard/statistics of which videos have been transcribed?
Comment from PhysicsArmature on 8 April 2023 at 19:09
@ osm.org/user/%E5%BF%AB%E4%B9%90%E7%9A%84%E8%80%81%E9%BC%A0%E5%AE%9D%E5%AE%9D I am not aware of any dashboard.