On Tuesday, Meta revealed AudioCraft, a set of generative AI models that can create "high-quality and realistic" music from text, according to Meta.
Audiocraft consists of three of Meta's generative AI models: MusicGen, AudioGen, and EnCodec. Both MusicGen and AudioGen generate sound from text, with one generating music and the latter generating specific audio and sound effects.
You can visit MusicGen on HuggingFace and play with the demo. For the prompt you can describe any type of music you'd like to hear from any era. For example, Meta shares the example, "An 80s driving pop song with heavy drums and synth pads in the background".
EnCodec is an audio codec comprised of neural networks that compress audio and reconstruct the input signal. As part of the announcement, Meta released the most improved version of Encodec that allows for higher-quality music generations with fewer artifacts, according to the release.