Personal Home Recording Studio: ACE-Step UI Review
Have you ever thought about how much money goes to Suno or Udio subscriptions? These services are cool, no doubt, but paying $10-20 per month for music generation, whose rights may also be limited by license terms, is a dubious pleasure. Not to mention that your prompts and audio are stored somewhere on someone else's servers.
Recently I stumbled upon the ace-step-ui project, which changes the game. It's an open-source interface for the ACE-Step 1.5 model. Basically, you get a full-fledged Suno alternative that runs locally on your GPU. Free, unlimited, and completely private.
What is it anyway
The project is a modern shell for a neural network that can generate full songs with vocals, instrumental and separate tracks. The stack consists of React 18, TypeScript and TailwindCSS on the frontend, while the backend runs on Node.js with an SQLite database.
The main feature here is accessibility. Usually, running similar neural networks turns into a quest "install 100500 Python dependencies and don't break your system". The author of ace-step-ui went a different way and offered several installation options, including one-click installation via Pinokio.
What it's good at in practice
If you've used web interfaces for audio generation from the 2010s, ACE-Step UI will pleasantly surprise you. The interface is clearly inspired by Spotify: dark theme, clean player at the bottom of the page, and a convenient track library.
Process control
Unlike cloud services, where you often press a button and hope for a miracle, here you can tweak the settings:
- BPM and key selection.
- Song structure configuration via tags like
[Verse]or[Chorus]. - "Thinking Mode" when the neural network first plans the structure and then generates the audio code. It's slower, but the result is head and shoulders above.
- Working with references. You can upload your own audio file and the model will try to copy its style or structure.
Built-in processing tools
This isn't just a "text to music" generator. The interface includes several useful utilities that you usually have to find separately:
- AudioMass audio editor for quick trimming and effects.
- Stem separation (Demucs). If you like the beat but don't need the vocals (or vice versa), the neural network will break the track down into components: drums, bass, vocals, and everything else.
- Video generator. You can attach a background from Pexels and get a ready-made clip for social media right away.
Technical nuances and hardware
Let's address the burning question right away: how much VRAM do you need? For basic work, a card with 4 GB of VRAM will suffice (for example, RTX 3050 or even an old 1050 Ti with some patience). But if you want to enable the smart mode with LLM (Thinking Mode), the requirements grow. Developers recommend 12 GB of VRAM for comfortable work with all the features, but the project can run the language model on the CPU, which saves owners of average PCs.
Interestingly, your entire generation database is stored locally in SQLite. No cloud, no sync. If the disk dies — the music goes with it, so backups are still a must.
How to run it
The laziest and most reliable way is Pinokio. If you prefer the terminal, the process looks something like this:
- You need to clone the ACE-Step 1.5 engine and install its dependencies. On Windows there's a portable version that weighs about 5 GB and already contains configured Python.
- Clone ace-step-ui itself.
- Launch two servers: one handles the neural network API, the second handles the interface and backend.
For Windows users, there are ready-made .bat files in the repository that automate the launch.
cd ace-step-ui
start-all.bat
After that, your personal studio will open at http://localhost:3000.
Is it worth trying
The project is definitely suitable for content creators: streamers, YouTubers, or indie game developers who need a unique soundtrack without royalty payments.
Yes, the generation quality sometimes may fall short of top paid services in terms of lyrics, but the ability to generate unlimited variations and immediately break them down into stems outweighs this downside. Plus, working locally over LAN allows you to put the server on a powerful PC in one room and brainstorm ideas from a tablet or laptop in another.
If you have an NVIDIA card and a couple of free evenings, ACE-Step UI is a great reason to try out modern AI production without hitting your wallet.
Related projects