GSoC : And now my watch begins!
· 3 min read
co-oCCur: High-speed subtitle synchronization
has been accepted by
for Google Summer of Code (GSoC) 2019!
I am thrilled to bits to spend my summer working with the “magicians” behind the de-facto subtitle tool - CCExtractor. My mentor is Carlos Fernandez Sans (org admin, who originally built CCExtractor). What more could one ask for?
Simply put, it is an international annual program by Google aimed at promoting
Open Source Software development
among college and university students. Students apply for a project/idea of their choice to one of the many
Open Source Organizations
selected by Google, by submitting a proposal. Selected students spend their summer building the project by implementing what they proposed.
In return, the students gain excellent experience of building a real-world project, mentored by seasoned developers. The stipend 💰 and of course the bragging rights! 😎
About my project
Out of the many ideas, present in the CCExtractor’s GSoC page, I gravitated to Writing high-Speed subtitle synchronization tool. After about a month of research on the same, I submitted my proposal on the 9th of April. Here is my accepted proposal!
Closed Captions (CC)
and subtitles enhance audiovisual content by providing speech information and description of representative events in a textual format.
Captioning is especially used as an aid for people with hearing loss or deafness, but its use is definitely not limited to that domain.
For instance, it is frequently the case that captions are necessary to watch a TV show in a noisy surrounding or when one is not familiar with the language or accent available in the audio streams.
Downloading a subtitle document and playing it alongside an episode of your favourite TV show, is not rocket science, but it sure can feel that way sometimes. Getting the subtitle document that gives satisfactory synchronization on the first attempt is like hitting the jackpot. I know the annoyance that comes in with misaligned subtitles and is a very general problem.
For an ideal subtitle file, the subtitles are perfectly aligned with the base audiovisual content. In other words, the audio and the corresponding subtitles co-oCCur.
The misalignment of the subtitle files is the underlying problem that this project aims to solve so that the viewer does not have any burden before the fun starts (this is what matters).
For the next 3 months, I will be working on this amazing project. I hope by the end of this summer the tool is ready to be used!
"Every skill you acquire, doubles your odds of success"