neural frames logo
Stems in Music: What They Are and How neural frames Extracts Them

Stems in Music: What They Are and How neural frames Extracts Them

Music production has come a long way, with various technological advances simplifying and revolutionizing the process. One such advance is the use of "stems" in music. In this article, we'll explore what stems are in music and how neural frames, an innovative AI music video generator, uses AI to extract them.

What Are Stems in Music?

Stems, in the context of music production, refer to the individual tracks or components that make up a song. These can include different instruments, vocals, and other sound elements. For instance, the stems of a rock song might include separate tracks for the vocals, guitar, bass, and drums.

Having access to the stems of a track gives producers and remixers more control over the music. They can adjust the volume of individual elements, add effects, or even remove components entirely.

Extracting Stems with neural frames

Neural frames takes the concept of stems one step further by leveraging AI to extract stems from any uploaded track. This innovative feature not only breaks down the track into its fundamental components but also categorizes them into specific groups such as snare, hi-hats, toms, kick, keys, vocals, and bass.

The stem extraction in neural frames is powered by advanced AI algorithms. When you upload a track, the AI gets to work analyzing the different audio elements. Once it identifies the distinct components, it isolates each one, effectively creating a set of stems from the original track.

The video editor of neural frames, showing not only the extracted stems, but also a modulation based on the snare. We also estimate the BPM of the song.

Why Extract Stems with neural frames?

With neural frames, you can do more than just create stems from your music tracks. You can use those extracted stems to modulate different visual parameters in your AI-generated music video.

For example, you could have the kick drum modulate the zoom of the camera, creating a video that pulses in time with the beat. Or, you could use the vocal track to control the color palette, making the visuals shift with each note.

By extracting and utilizing stems in this way, neural frames gives you unprecedented control over your music video's aesthetics. You're not just syncing visuals with your music – you're creating a unique audio-visual experience that responds directly to the nuances of your track. See for instance this squirrel, synced to the snare track of the song (The song has been AI-generated with Soundful)

Wrap Up

In the world of music production, stems play a vital role by providing control and flexibility. Neural frames leverages AI to bring the concept of stems into the realm of music video production, offering creators a powerful tool to create compelling, responsive visuals.

Whether you're an artist looking to create a music video for your latest track, a content creator aiming to generate engaging visuals, or a producer eager to experiment with AI, understanding stems and how neural frames extracts them could revolutionize your creative process. Embrace the future of music video creation with neural frames and see what this innovative technology can do for your music.

No VC money, just a physicist turned indiehacker in love with text-to-video. Contact me here: contact(at) This website is greatly inspired by Deforum, but doesn't actually use it. For inspiration on prompts, I recommend Civitai.

Use cases

AI Animation GenerationAI Music Video GeneratorTrippy VisualsAI Video EditorText to video