Speech to Text converter in android

Speech to Text converter in android

Android has a very cool feature that still many developers don’t know. Apps like Any.DO uses speech to text conversion feature quite creatively. In today’s world of Siri, voice commands are of utmost importance. Android natively provides the feature of Speech to Text so why not to use it in our app!

I will show you how to use Android’s Speech to Text API in an application.
Let’s make our demo application.

           The App will be very simple. It will have a button with Mic symbol. On click of which we trigger Android’s Speech to Text Intent which shows a dialog to take speech input. The speech input is then converted into text. The text is then displayed in a text view.

Step 1: Create Basic Android Project in Eclipse

Create a Hello World Android project in Eclipse. Go to New > Project > Android Project. Give the project name as SpeechToTextDemo and select Android Runtime 2.1 or SDK 7. I have given package name net.viralpatel.android.speechtotextdemo.

Once you are done with above steps, you will have a basic hello world Android App.

Step 2: Change the Layout

For our demo, we need simple layout. Just one Image Button to trigger Speech to Text API and one TextView to display result text that is converted from speech.

Open layout/main.xml in your android project and replace its content with following:

File: res/layout/main.xml
<LinearLayout xmlns:android="http://schemas.android.com/apk/res/android"
    android:orientation="vertical" >
        android:src="@android:drawable/ic_btn_speak_now" />
        android:textAppearance="?android:attr/textAppearanceLarge" />

The UI is very simple. One LinearLayout to organize the button and text view. Note the id for the button btnSpeak and text view txtText which we will use in our Java code.

Step 3: Android Java Code to trigger Speech to Text API

Open SpeechToTextDemoActivity class and replace the code with following.

File: SpeechToTextDemoActivity.java

import java.util.ArrayList;

import android.app.Activity;
import android.content.ActivityNotFoundException;
import android.content.Intent;
import android.os.Bundle;
import android.speech.RecognizerIntent;
import android.view.Menu;
import android.view.View;
import android.widget.ImageButton;
import android.widget.TextView;
import android.widget.Toast;

public class MainActivity extends Activity {

protected static final int RESULT_SPEECH = 1;

private ImageButton btnSpeak;
private TextView txtText;

public void onCreate(Bundle savedInstanceState) {

txtText = (TextView) findViewById(R.id.txtText);

btnSpeak = (ImageButton) findViewById(R.id.btnSpeak);

btnSpeak.setOnClickListener(new View.OnClickListener() {

public void onClick(View v) {

Intent intent = new Intent(

intent.putExtra(RecognizerIntent.EXTRA_LANGUAGE_MODEL, "en-US");

try {
startActivityForResult(intent, RESULT_SPEECH);
} catch (ActivityNotFoundException a) {
Toast t = Toast.makeText(getApplicationContext(),
"Ops! Your device doesn't support Speech to Text",


public boolean onCreateOptionsMenu(Menu menu) {
getMenuInflater().inflate(R.menu.activity_main, menu);
return true;

protected void onActivityResult(int requestCode, int resultCode, Intent data) {
super.onActivityResult(requestCode, resultCode, data);

switch (requestCode) {
if (resultCode == RESULT_OK && null != data) {

ArrayList<String> text = data


                The heart of Speech to text Android API is package android.speech and specifically class android. speech.RecognizerIntent. Basically, we trigger an Intent (android.speech.RecognizerIntent) which shows the dialog box to recognize speech input.

This Activity then converts the speech into text and send backs the result to our calling Activity. When we invoke android.speech.RecognizerIntent intent, we must use startActivityForResult() as we must listen back for result text. Note how in above code we create intent android.speech.RecognizerIntent and trigger it.

Also we add one extra parameter using .putExtra() method. When invoking RecognizerIntent, we must provide extra RecognizerIntent.EXTRA_LANGUAGE_MODE. Here we are setting its value to en-US. Since we triggered the RecognizerIntent via startActivityForResult(), we override methodonActivityResult(int requestCode, int resultCode, Intent data) to handle the result data. The RecognizerIntent will convert the speech input to text and send back the result as ArrayList with key RecognizerIntent.EXTRA_RESULTS.

Generally, this list should be ordered in descending order of speech recognizer confidence. Only present when RESULT_OK is returned in an activity result. We just set the text that we got in result in text view txtText using txtText.setText(). One thing worth noting here is how to handle devices/android version that doesn’t support speech to text API.

In such case, exception ActivityNotFoundException will be thrown when we try to start the activity. In above example, we have caught this exception and displayed a message “Oops! Your device doesn’t support Speech to Text” using Toast.


Speech to Text converter

Speech to Text converter
Speech to Text converter

Speech to Text converter

Leave a Reply

Your email address will not be published. Required fields are marked *