Guide: How To Create The Best VTuber Setup

April 26, 2022
10 min read
By
Rokoko

VTubers, also known as virtual YouTubers, use motion capture to create a real-time digital avatar that mimics their facial expressions and body movements. This guide will show you precisely what you’ll need to start VTubing. VTuber setups are about as diverse as the setups of regular Youtuber creators. The route you choose depends on your budget, content, and the kind of ‘look’ you want for your channel.

This guide covers:

  • The most common setups for non-animators who are new to VTubing
  • What different VTubing configuration cost 
  • What workflow you’ll need to create your avatars in different software
  • Free VTuber software
  • How VTubers achieve full-body motion capture
  • How to stream your avatar to various platforms in real-time

First, do you know what kind of VTuber you want to be? 

The first thing to consider is how you’re planning to build your VTuber setup: 

  1. As an independent creator: You’re in complete control of your avatar. You’re the pilot, operator, voice actor, and chat moderator! It can be especially stressful as you’re also live-streaming most of the time. However, you have total creative control and can create any world you can imagine. In some cases, creators band together into small indie teams and divide the workload.
  2. As an agency: Agency VTubers are almost like regular actors. The ‘VTuber’ is a digital character managed by a team of people while an actor performs the vocals and movements. Agencies will also manage the VTubers marketing, social media, etc.

Tip: If you’re an English VTuber who wants to get to the next level, take a look at the English-language agency VShojo — they provide the support, management, and outreach you need. 

Hololive creates and manages a large number of popular VTubers.

We’ll look exclusively at the kind of setup you need to be an independent creator. This in-depth guide should give you a good idea of what hardware to buy, what software to use, and how to start streaming. 

How much do I need to spend on my VTuber setup? 

The important thing to remember is that VTubing can be totally free or cost thousands of dollars. It depends on the quality and range of motion capture you want and your character development costs. If you’ve got a half-decent computer with a webcam, you’re already on the right track. It’s feasible to spend $0 and still stream an animated character with fairly good facial movement. However, if you want better animation fidelity and a full range of motion, you can expect to spend around $4000 - $15,000 on hardware and software.

The three levels of VTuber hardware.

There are three levels of motion capture animation that you can achieve as a VTuber: 

What you need to know when choosing a VTuber avatar

Some entry-level VTuber software will provide you with a large collection of free avatars to choose from. Some even allow you to customize your avatar with a few sliders. This is great when you’re just starting out, but if you want to break away from the standard designs, you’ll need to create your own model. 

It’s unlikely that you will find a VTuber model on asset marketplaces like Tubosquid, as most of those models are not rigged for motion capture and don’t have the correct blendshapes. You have three options: 

  • Use a sophisticated VTuber maker like Vroid Studio, Daz3D, MetaHuman Creator, or Character Creator 3. Keep reading for more info on these avatar creators. 
  • Have your character custom commissioned from a reputable 3D artist. There are plenty on Fiverr and other marketplaces. 
  • Create a character yourself in Autodesk Maya or a similar application.

Note: This guide won’t dive into custom character creation in programs like Maya because it’s a pretty advanced subject. It takes years to learn how to sculpt good 3D models. To create your own model in Maya or with an avatar builder, remember specific blendshapes are required for VTuber models as most facial motion capture solutions use Apple’s ARKit and TrueDepth camera. You’ll have to stick to their standard 56 blendshapes to make your character compatible with VTuber software. 

Free VTuber software that’s all-in-one

Once you’ve got an avatar, you need to import it into software that can retarget the motion capture data and render your character in real-time. The three most popular free tools that VTubers use to create content are VUP, Vtube Studio, and Amimaze by Facerig. All three are free to use and have paid versions. If you want to upgrade to higher quality facial motion capture, full-body performance capture, and maybe even finger capture, we recommend using Unreal Engine due to its powerful render engine and all-around stability. The software is still totally free to use; it just has a much steeper learning curve.

VUP 

VUP is a popular program for people making a VTuber model in an anime character style. It allows you to customize existing avatars, upload custom builds, and record facial mocap with your webcam or phone. It’s free to use, plus you can import motion capture data from additional tools like motion capture gloves. VUP also supports 2D VTubers using 2D Live models.

VUP’s interface is really easy to use.

Vtube Studio

VTube Studio is another software with plenty of anime-styled characters. However, it only deals with live2D models. 

Vtube Studio is a good choice if you prefer a 2D anime style.

Animaze

Animaze is free software that’s a great option for people in search of the easiest way to start a live broadcast today. You can create custom avatars, buy community models, and even upload your own characters from external software. You can also find many animal models or models from popular games. It’s free to use and can import motion capture data from a limited number of mocap tools. 

Animaze comes with a huge variety of premade avatars.

Full-Body VTuber Setup using Rokoko

Full body performance capture will record the motion of your entire body. It’s significantly more advanced than the all-in-one options highlighted above — but is the best choice if you want to output high-quality content. This setup assumes that you’re using fairly powerful computer equipment that exceeds the minimum requirements for running 3D applications. You’ll also need an iPhone X or higher at your disposal. You do NOT need a big studio space or a greenscreen as Rokoko works using inertial motion capture technology and not optical. 

Rokoko Creative Director Sam Lazarus runs the Rokoko Youtube channel from his home, and uses mocap in many tutorials.

 Here’s the hardware you need: 

For more in-depth setup details, check out this Youtube Series. Be aware that there are a few minor networking considerations when using an inertial motion capture solution. 

The software you need assuming you already have a 3D character and will be streaming with Unreal Engine:

  • Rokoko Face Capture or a similar solution for facial tracking with your iPhone (iPhone X or higher, as they have a TrueDepth Camera)
  • Rokoko Studio to capture all the mocap in real-time
  • Rokoko Unreal plugin to record or stream the mocap directly into your 3D scene in Unreal
  • Unreal Engine to create your scene and render it in real-time using a virtual camera
  • OBS or another screen recorder to capture your viewport and stream to twitch etc.  

So how do VTubers stream to Twitch or Youtube? 

Once you're piloting your digital avatar in Unreal Engine, it’s pretty easy to get it onto a stream. A virtual streamer follows the same process as a regular streamer — you broadcast your screen with a video overlay.

If you’ve ever used any kind of streaming software, then you probably know about Open Broadcaster Software or OBS for short. OBS is free, open-source software for video recording and live streaming that works by simply capturing your display. To overlay your character on a video or a game, you’ll need to set up a green screen in Unreal. You can use the tutorial below to see exactly how to achieve that effect. This tutorial will teach you how to use OBS and a green screen in Unreal Engine to isolate your 3D character:

Take your VTubing to the next level with full-body motion capture

Looking for more information on Rokoko’s motion capture solutions and how it works for VTubers? Book a demo here. 

Book a personal demonstration

Schedule a free personal Zoom demo with our team, we'll show you how our mocap tools work and answer all your questions.

Product Specialists Francesco and Paulina host Zoom demos from the Copenhagen office