How to Make Subtitles Automatically Using Speech to Text

By James T Wood

Voice recognition software that's capable of speech-to-text conversion for video files with multiple voices typically does not do an adequate job. However, if there is only one clear voice in the video, you may be able to use voice recognition software to automatically create captions for your video files. Reducing the sound interference gives you the best chance for getting good speech-to-text conversion, although you still need to proofread the subtitles at the end of the process to avoid any egregious errors.

Step 1

Plug the 3.5 mm audio cable into the headphone jack on your computer, then plug the other end into the microphone jack on your computer. This feeds the speaker output directly to the microphone input and removes ambient noise interference from the speech-to-text application.

Step 2

Launch your video player and open the video file you want to transcribe. Cue up the video to the point where the speaking starts. If you need to hear the video during this part, unplug the audio cable from the headphone jack on your computer.

Step 3

Launch your speech-to-text application and a text editing program. Start the speech recognition engine. Plug the audio cable back in if you unplugged it in Step 2.

Step 4

Start the video playing and click on the text editing program so that the voice recognition can insert the text from the video into the file.

Step 5

Stop the voice recognition software when the video is done playing.

Step 6

Proofread the text to ensure that the speech to text was accurate. Correct any mistakes, then save the text file and close the text editor.

Step 7

Close the video playback program, launch your video editing software and use its caption tool to access the text file you created. Fine-tune the subtitle placement within the video file so that the words printed on the screen match with the words being spoken in the audio track.

Step 8

Save the video file with the subtitles.