Visually Indicated Sounds | Two Minute Papers #79

The Scholarly Store is available here:

Using the power of deep learning, it is now possible to create a technique that looks at a silent video and synthesize appropriate sound effects for it. The usage is at the moment, limited to hitting these objects with a drumstick.

Note: The authors seem to lean on a database of sounds, i.e., the synthesis does not happen from scratch, but they are not merely fetching the database entry for a given sound, but performing example-based synthesis (Section 5.2 in the paper below). In the video and the paper, they both use the words “synthesized sound” and “predicted sound”, and it may be a bit unclear what degree of synthesis qualifies as a “synthesized sound”. I think this is definitely worthy of further scrutiny.

_____________________________________

The paper “Visually Indicated Sounds” is available here:

Recommended for you:
What Do Virtual Objects Sound Like? –
Synthesizing Sound From Collisions –
Reconstructing Sound From Vibrations –

Our deep learning-related videos are available here (if you are looking for convolutional neural networks, recurrent neural networks):

WE WOULD LIKE TO THANK OUR GENEROUS PATREON SUPPORTERS WHO MAKE TWO MINUTE PAPERS POSSIBLE:
David Jaenisch, Sunil Kim, Julian Josephs, Daniel John Benton.

We also thank Experiment for sponsoring our series. –

Subscribe if you would like to see more of these! –

The thumbnail background image was created by slgckgc –
Splash screen/thumbnail design: Felícia Fehér –

Károly Zsolnai-Fehér’s links:
Facebook →
Twitter →
Web →

source

What's Hot

Google DeepMind’s Gemini Agent : Autonomous Al Coding Agent

OpenAI’s Next Bet: Intel Stock?

S&P Global Uses IBM AI To Boost Efficiency – IBM (NYSE:IBM)

Visually Indicated Sounds | Two Minute Papers #79

Why Gamers Will Never See Hair The Same Way Again

NVIDIA Just Solved The Hardest Problem in Physics Simulation!

The Next Level of AI Video Games Is Here!

Matthiesen Gallery Files Lawsuit Over Gustave Courbet Painting

MoMA Partners with Mattel for Van Gogh Barbie, Monet and Dalí Figures

Underground Film Legend and Artist Dies at 92

Basquiat Work on Paper Headline’s Phillips’ Frieze Week Sales

Google DeepMind’s Gemini Agent : Autonomous Al Coding Agent

OpenAI’s Next Bet: Intel Stock?

S&P Global Uses IBM AI To Boost Efficiency – IBM (NYSE:IBM)

What's Hot

Visually Indicated Sounds | Two Minute Papers #79

Related Posts

Subscribe to Updates