Media & Entertainment

Where is voice tech going?

Comment

Image Credits: Luis Alvarez (opens in a new window) / Getty Images

Mark Persaud

Contributor

Mark Persaud is digital product manager and practice lead at Moonshot by Pactera, a digital innovation company that leads global clients through the next era of digital products with a heavy emphasis on artificial intelligence, data and continuous software delivery.

2020 has been all but normal. For businesses and brands. For innovation. For people.

The trajectory of business growth strategies, travel plans and lives have been drastically altered due to the COVID-19 pandemic, a global economic downturn with supply chain and market issues, and a fight for equality in the Black Lives Matter movement — amongst all that complicated lives and businesses already.

One of the biggest stories in emerging technology is the growth of different types of voice assistants:

  • Niche assistants such as Aider that provide back-office support.
  • Branded in-house assistants such as those offered by BBC and Snapchat.
  • White-label solutions such as Houndify that provide lots of capabilities and configurable tool sets.

With so many assistants proliferating globally, voice will become a commodity like a website or an app. And that’s not a bad thing — at least in the name of progress. It will soon (read: over the next couple years) become table stakes for a business to have voice as an interaction channel for a lovable experience that users expect. Consider that feeling you get when you realize a business doesn’t have a website: It makes you question its validity and reputation for quality. Voice isn’t quite there yet, but it’s moving in that direction.

Voice assistant adoption and usage are still on the rise

Adoption of any new technology is key. A key inhibitor of technology is often distribution, but this has not been the case with voice. Apple, Google, and Baidu have reported hundreds of millions of devices using voice, and Amazon has 200 million users. Amazon has a slightly more difficult job since they’re not in the smartphone market, which allows for greater voice assistant distribution for Apple and Google.

Image Credits: Mark Persaud

But are people using devices? Google said recently there are 500 million monthly active users of Google Assistant. Not far behind are active Apple users with 375 million. Large numbers of people are using voice assistants, not just owning them. That’s a sign of technology gaining momentum — the technology is at a price point and within digital and personal ecosystems that make it right for user adoption. The pandemic has only exacerbated the use as Edison reported between March and April — a peak time for sheltering in place across the U.S.

Image Credits: Mark Persaud

When we look at the adoption cycle, voice is evolving in different stages. Measured by monthly active users, we are still in early stages of voice’s overall adoption lifecycle with devices such as smartwatches. But use of smartphones has penetrated half the U.S. population. Voice search is mature, with two-thirds of the U.S. population using it because they’re comfortable with it. As with most technologies, change happens unevenly. “Voice first” doesn’t mean everyone is using voice the same way, rather in a breadth of ways, which speaks to its applicability across contexts.

Voice is global

It’s all too easy to think of voice just in the context of the U.S. market, but voice is a global phenomenon. China accounts for 30%-40% of smart speaker sales, and the rate of total installed base is catching up. Albeit the digital context for using voice is different in China, it’s usually tied to a super app’s ecosystem.

Regional differences become even more striking when you examine the different assistants catching on globally. The big voice assistants such as Alexa, Cortana, Google Assistant and Siri do not speak for the world.

Image Credits: Mark Persaud

This is a global technology adoption and consumer behavior movement, which makes it exceedingly exciting to be involved with and continue to explore for businesses around the world.

Voice design and sonic branding are becoming more prevalent

With all these (perhaps commoditized) voice experiences, remember that value gets created from the experience and relationship established with users. Voice design and voice user interface (VUI) creation still greatly matter, and will continue to grow in importance. It’s far too easy to create poor voice experiences — unfortunately the public has seen many, many poor Alexa skills or Google Actions that leave you in a voice interaction loop or an inability to course correct. A poor voice user experience is frustrating for users and more harmful to a brand than a bad text-based website interaction.

That’s because a voice-based experience is less forgiving. With a poorly designed VUI, the user lacks a way to decipher the content or information further. User comments like “Where do I go from here?”, “That’s not what I asked” and “I’m not sure what to do with that information” are statements that VUI designers do not want to hear. This is, of course, provided that the user was understood by the automated speech recognition (ASR) and natural language understanding (NLU), and received a response from the voice application.

All of this decreases the user’s trust in the medium and pushes them back to, say, websites or phone calls. As a result, the bad brand experience might result in the user not wanting to interact with the brand via the voice interface again, which will be a major setback when competitors are thriving in the space and voice commerce becomes more prevalent. It’s tempting and easy for users to try voice and say, “I like the old way better” because the old way is more reliable, or they know how to navigate it. That’s the common issue with the new and change altogether.

The uptake of voice assistants reminds me of the adoption of websites into mainstream society. Websites weren’t always as helpful or as beautiful as they are today. While many factors influenced the proliferation of websites (the internet, internet speed, browser compatibility, mobile versions, etc.), it all started with content sharing and simple functionality. Over time, websites have evolved into aesthetically beautiful, eye-luring, easily navigable media.

Voice will be no different, having started with a very wide breadth of voice experiences and homing in on what works and what doesn’t for the users and brands they serve, to adding contextual relevancy for where they’re being used, and last to adding personality and sonic branding.

Some brands (McDonald’s and CBS to name a few) have adopted a jingle or sonic brand. When you hear their familiar notes, you think of the brands. Those moments of familiarity pay off years of effort and user training with the voice medium.

Additionally, consider brands that have a strong brand personality such as Slim Jim, Headspace and Airbnb that are utilized to create voice-based experiences with personalities to complement their visual identities. This comes to life when brand voice experience considers tone, timbre, intonation and lexicon. Literally being able to exude the brand voice straight to a user’s ears. This will push the brand-user relationship to be even stronger (perhaps even reestablishing loyalty in newer generations), when done correctly.

Addressing 2020 head-on with voice

Contactless (commercial, public, retail) interactions

As brands address the health and safety concerns of consumers to restart their businesses, contactless interactions rise to the top. Removing (or minimizing) the physical touchpoints of a business is making people think digital-first in a quick, prioritized way as, for many businesses, their livelihood depends on it in a way not felt before. Businesses are adapting their mindset from “when I have time for digital’ to “digital has to happen now.”

Using voice-enabled applications has now become a part of that transformation — to do everything from browsing, getting information and navigating to ordering products and checking out. From a personal health standpoint, using our voices is less risky behavior than an interface that requires touching a user-shared screen or paying with and receiving unsanitized cash (activities that usually require you to be within six feet of others, especially strangers). The airport and restaurant industries will likely be the first to address these issues as they’ve been hit hard with today’s pandemic and the recessionary economy.

Assisting at-home education

In the spring of 2020, many parents everywhere suddenly became de facto home schoolers as schools shut down and kids were sent home. This unbelievably stressful burden may continue into the fall. The situation is untenable. A recently published New York Times article says it all: “In the Covid-19 Economy, You Can Have a Kid or a Job. You Can’t Have Both.

Voice is attempting to provide some relief. Google showed us one example. Earlier in 2020, Google launched a new voice assistant that helps parents who are home-schooling their kids. Titled Diya, the assistant is designed to teach children how to read. Diya uses stories and word games to help kids five and up. Diya uses Google’s speech recognition technology to spot mistakes and areas that are challenging kids. I imagine there are more ways voice can and will help parents as they attempt to manage the demands of working and home-schooling.

Empowering physical and mental health

As people sought ways to understand the health threat created by COVID-19, the Mayo Clinic introduced an Alexa skill for people to get answers to questions about COVID-19. This was an important example of how voice could contribute to the well-being of others while simplifying access.

Of course, the pandemic has created unprecedented levels of stress as people manage the health threat of an unchecked pandemic, forced isolation, and the threat of job loss and economic instability. People are struggling to cope. I see a meaningful opportunity for voice to help people manage mental health. For example, MoonPie created a virtual roommate that entertains people stuck at home in isolation — a whimsical example, to be sure, but in 2020, entertainment has taken on a more meaningful role.

Meanwhile, meditation app Headspace provides a voice-based interface to make it easier to meditate with a voice command. That kind of a tool could be a lifesaver for anyone who counts themselves among the surging numbers of people fighting mental exhaustion and stress.

Sharing workplace culture at home

The future of the workplace remains uncertain. Some companies are slowly opening their brick-and-mortar locations and offices. Others are not. Twitter famously told employees they can work at home indefinitely. This dramatic change in how we work creates new challenges, such as maintaining a sense of culture when people are not in the same place.

For example, using voice to share customized messages amongst colleagues, or using random voice Easter eggs to mimic someone stopping by your desk to share an inside joke. We miss our colleagues and their ad hoc banter, their interesting insights and their supportive attitudes (the terms “work-wife” or “work-husband” exist for a reason). Voice can help people make life apart have more lovable teammate moments and reinvigorate the culture we’re missing.

Supporting social awareness (and justice)

In the wake of the global social equality unrest that erupted around the world, Amazon, Apple and Google made some important changes to Alexa, Siri and Google Assistant. As a number of news outlets reported, if you ask Google Assistant whether Black lives matter, Google Assistant began providing more thoughtful replies, such as, “Black people deserve the same freedoms afforded to everyone in this country, and recognizing the injustice they face is the first step towards fixing it.” If you asked whether “all lives matter,” Google Assistant replies, “Saying ‘Black Lives Matter’ doesn’t mean that all lives don’t. It means Black lives are at risk in ways that others are not.” Both Alexa and Siri respond with similarly sensitive, nuanced answers instead of “of course,” or “I don’t understand your question.”

Enterprises might do well to listen to ideas bubbling up at a grassroots level. I recently read about a Reddit user who developed a Siri shortcut that makes it possible for someone when being pulled over by the police, to say, “Hey, Siri, I’m getting pulled over” — which results in Siri sending your current location to a designated person and automatically starts recording a video.

How might businesses go beyond using voice to make us more aware of Black Lives Matter to actually helping protect social justice and civic responsibility?

What does this all mean

The possibilities for voice are ever expanding — getting smarter, more personalized, in more contexts, assisting with broader messaging — especially in how it fits into a brand’s digital ecosystem, and more importantly the consumer’s ecosystem. Start investigating your voice ideas by running a voice design sprint. It’s a new world, and voice technology is shaping it.

More TechCrunch

Struggling EV startup Fisker has laid off hundreds of employees in a bid to stay alive, as it continues to search for funding, a buyout or prepare for bankruptcy. Workers…

Fisker cuts hundreds of workers in bid to keep EV startup alive

Chinese EV manufacturers face a new challenge in their pursuit of U.S. customers: a new House bill that would limit or ban the introduction of their connected vehicles. The bill,…

Chinese EV makers, and their connected vehicles, targeted by new House bill

With the release of iOS 18 later this year, Apple may again borrow ideas third-party apps. This time it’s Arc that could be among those affected.

Is Apple planning to ‘sherlock’ Arc?

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! This is the startup world’s main event, and it’s where you’ll find the knowledge, tools…

Meet Visa, Mercury, Artisan, Golub Capital and more at TC Disrupt 2024

Featured Article

The women in AI making a difference

As a part of a multi-part series, TechCrunch is highlighting women innovators — from academics to policymakers —in the field of AI.

3 hours ago
The women in AI making a difference

Ifeel is being offered as part of an employer’s or insurance provider’s healthcare coverage.

Mental health insurance platform ifeel raises a $20 million Series B

Instead of opening the user’s actual browser or a WebView, Custom Tabs let users remain in their app while browsing.

Google Chrome becomes a ‘picture-in-picture’ app

Sanil Chawla remembers the meetings he had with countless artists in college. Those creatives were looking for one thing: sustainable economic infrastructure that could help them scale rather than drown…

Slingshot raises $2.2 million to provide financial services to artists

A startup called Firefly that’s tackling the thorny and growing issue of cloud asset management with an “infrastructure as code” solution has raised $23 million in funding. That comes on…

Firefly forges on after co-founder murdered by Hamas

Mistral, the French AI startup backed by Microsoft and valued at $6 billion, has released its first generative AI model for coding, dubbed Codestral. Like other code-generating models, Codestral is…

Mistral releases Codestral, its first generative AI model for code

Pinterest announced today that it is evolving its Creator Inclusion Fund to now be called the Pinterest Inclusion Fund. Pinterest teamed up with Shopify’s Build Black and Build Native programs…

Pinterest expands its Creator Fund to allow founders

Cadillac may seem a bit too traditional to hang its driving cap on EVs. And yet, that hasn’t stopped the GM brand from rolling out — or at least showing…

Cadillac’s new Optiq EV is designed to hook young hipsters

Alex Taub, a longtime founder with multiple exits under his belt, believes it’s time to disrupt the meme industry. “I have this big thesis that meme tech is going to…

This founder says meme tech is the next big thing

Lux, the startup behind popular pro photography app Halide and others, is venturing into video with its latest app launch. On Wednesday, the company announced Kino, a new video capture app…

Kino is a new iPhone app for videographers from the makers of Halide

DevOps startup Harness has shown itself to be an ambitious company, building a broad platform of services while also dabbling in M&A when it made sense to fill in functionality.…

Harness snags Split.io as it goes all in on feature flags and experiments

Microsoft’s Copilot, a generative AI-powered tool that can generate text as well as answer specific questions, is now available as an in-app chatbot on Telegram, the instant messaging app.  Currently…

Microsoft’s Copilot is now on Telegram

HBO’s new documentary, “MoviePass, MovieCrash,” tells a story that many of us know about: how MoviePass, the subscription-based movie ticketing startup, was a catastrophic failure. After a series of mishaps…

MoviePass co-founders speak their truth in HBO’s new documentary 

The watch features a variety of different 3D games, unlocking more play time the more kids move.

Fitbit’s new kid smartwatch is a little Wiimote, a little Tamagotchi

In the video, a crowd is roaring at a packed summer music festival. As a beat starts playing over the speakers, the performer finally walks onstage: It’s the Joker. Clad…

Discord has become an unlikely center for the generative AI boom

After the Wirecard scandal, Germany’s financial regulator BaFin started to look more closely at young fintech startups that wanted to grow at a rapid pace — it’s better to be…

Germany’s financial regulator ends anti-money laundering cap on N26 signups after $10M fine

Among other things, this includes the ability to trace code from source to binary packages across both platforms, single sign-on support and unified project structures.

JFrog and GitHub team up to closely integrate their source code and binary platforms

The company’s public fund disbursement and e-commerce platform makes accepting school tuition and enabling educational enrichment more accessible. 

Tech startup Odyssey goes on journey to help states implement school choice programs

A new startup called Kinnect aims to help people privately save generational memories, traditions, recipes and more. The company’s app, launched this month, lets people create invite-only spaces where they…

Kinnect’s new app aims to help families record and store generational memories

Spotify has hiked its premium subscription in France by an eye-watering €0.13, in response to a new music-streaming tax.

Spotify hikes subscription price in France by 1.2% to match new music-streaming tax

The European Union has taken the wraps off the structure of the new AI Office, the ecosystem-building and oversight body that’s being established under the bloc’s AI Act. The risk-based…

With the EU AI Act incoming this summer, the bloc lays out its plan for AI governance

Solutions by Text, a company that gives people a way to pay their bills and apply for loans via text messaging, has secured $110 million in new growth funding. Edison…

Bootstrapped for over a decade, this Dallas company just secured $110M to help people pay bills by text

Owners of small- and medium-sized businesses check their bank balances daily to make financial decisions. But it’s entrepreneur Yoseph West’s assertion that there’s typically information and functions missing from bank…

Relay raises $32.2 million to help smaller businesses manage their cash flow

When other firms were investing and raising eye-popping sums, Clean Energy Ventures took a different approach. It appears to be paying off.

How Clean Energy Ventures avoided the pandemic bubble and raised a $305M fund

PwC, the management consulting giant, will become OpenAI’s biggest customer to date, covering 100,000 users.

OpenAI signs 100K PwC workers to ChatGPT’s enterprise tier as PwC becomes its first resale partner

Tech enthusiasts and entrepreneurs, the clock is ticking! With just 72 hours remaining until the early-bird ticket deadline for TechCrunch Disrupt 2024, now is the time to secure your spot…

72 hours left of the Disrupt early-bird sale