• Home
  • News
  • Coins2Day 500
  • Tech
  • Finance
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Google improves voice search (again) for its mobile apps

By
Derrick Harris
Derrick Harris
By
Derrick Harris
Derrick Harris
September 24, 2015, 3:00 PM ET
Facebook And Other Apps For iPhone And HTC Mobile Handsets
The Google Inc. company logo is seen on an Apple Inc. iPhone 4 smartphone in this arranged photograph in London, U.K., on Wednesday, Aug. 29, 2012. Apple Inc. is seeking a U.S. sales ban on eight models of Samsung Electronics Co. smartphones and the extension of a preliminary ban on a tablet computer after winning a patent trial against the South Korean company. Photographer: Chris Ratcliffe/Bloomberg via Getty ImagesChris Ratcliffe — Bloomberg via Getty Images

Google (GOOG) is claiming better voice search on its Android and iOS mobile apps, thanks to a new approach to the artificial intelligence technique the company uses to power that capability. A blog post published on Thursday, authored by a handful of Google researchers, explains in technical detail how they pulled off the improvements, which include faster, more-accurate transcriptions and better voice recognition in noisy places.

The boiled-down version is that Google switched its voice search system from one type of deep learning technique to another. In the old model, the system would analyze 10-millisecond snippets of audio and make predictions of words based on the sounds it recognized, regardless of the order in which they were uttered. The new model has a better memory, meaning it can consume larger snippets of audio and concern itself with the order in which particular sounds were spoken.

Here’s a more-technical, but illustrative explanation from the Google post:

If the user speaks the word “museum” for example—/m j u z i @ m/ in phonetic notation—it may be hard to tell where the /j/ sound ends and where the /u/ starts, but in truth the recognizer doesn’t care where exactly that transition happens: All it cares about is that these sounds were spoken.

Our improved acoustic models rely on Recurrent Neural Networks (RNN). RNNs have feedback loops in their topology, allowing them to model temporal dependencies: when the user speaks /u/ in the previous example, their articulatory apparatus is coming from a /j/ sound and from an /m/ sound before. Try saying it out loud – “museum” – it flows very naturally in one breath, and RNNs can capture that.

Google’s voice recognition team also added ambient noise and reverb to the the data it used to train its new system, meaning it does a better job understanding users trying to talk to their phones while in noisy places.

It’s all very complicated stuff from a computer science perspective, but is increasingly important to our everyday lives as we expect everything from our phones to our cars to be more intelligent. The techniques Google uses to power voice search in Android are related to what Apple (AAPL) is doing with Siri, what Microsoft (MSFT) with its Cortana digital assistant and what Amazon (AMZN) is doing with its various voice-controlled devices. They’re also related to techniques that allow software to recognize objects, faces and even our body movements.

If you want to learn more about how deep learning, the umbrella term for this collection of techniques, works, read Coins2Day‘s recent interview with Andrew Ng, the chief scientist at Chinese search engine giant Baidu (BIDU)and a renowned expert in the space.

To learn more about machine learning, watch this Coins2Day video:

Sign up for Data Sheet, Coins2Day’s daily newsletter about the business of technology.

About the Author
By Derrick Harris
See full bioRight Arrow Button Icon
Rankings
  • 100 Best Companies
  • Coins2Day 500
  • Global 500
  • Coins2Day 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Coins2Day Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Coins2Day Brand Studio
  • Coins2Day Analytics
  • Coins2Day Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Coins2Day
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

© 2025 Coins2Day Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Coins2Day Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.