Menu
What will it take to make AI sound human?

What will it take to make AI sound human?

'It's a matter of being personalized,' says CMU professor Alan Black

Pepper the robot appears on stage with a Softbank executive at CES in Las Vegas on Jan. 7, 2016 Credit: James Niccolai

Pepper the robot appears on stage with a Softbank executive at CES in Las Vegas on Jan. 7, 2016 Credit: James Niccolai

Conversation fillers such as "hmm" and "uh-huh" may seem like insignificant parts of human conversation, but they're critical to improving communication between humans and artificial intelligence.

So argues Alan Black, a professor in the Language Technologies Institute at the Carnegie Mellon School of Computer Science, who specializes in speech synthesis and ways to make artificially intelligent speech sound more real.

Both Siri and Cortana incorporate aspects of Black's work, he says. But for the most part, such technologies still boil down to a pretty simple pattern: The human speaks, then the machine processes that speech and answers.

"It's not really how humans interact," Black said in an interview on Friday. "It's a stilted kind of interaction."

Key to making such conversations more natural are pauses, fillers, laughs and the ability of speakers to anticipate and complete each other's sentences -- all of which help build rapport and trust.

"Laughing is part of communication," he said. "Machines don't do that -- if they did, it would be unbelievably creepy -- but ultimately they should."

Black and his students are working on those areas.

"You need mm-hmm, back channels, hesitations and fillers, and so far our speech synthesizers can't do that," Black said. "If a system does say 'uh-huh,' it sounds like a robot."

Technologies using synthetic voices typically use speech recorded by humans "in a little room reading sentences," he explained. That, in turn, is "why they sound bored."

Working with students, Black is experimenting with using voices recorded in dialog, so that even if you just capture and use one side, it's clear the speakers are engaged. The idea is to model and incorporate the variance in human responses rather than using the same response all the time -- otherwise, humans can tell it's fake, Black said.

Ultimately, good AI will also know your views on certain topics, such as which candidate you support or oppose in a political race, so it won't say something offensive.

"On a higher level, it's a matter of being personalized," Black said. "That can be creepy, but it can also be appropriate, and it's important for trust. It's all about building this thing that's close to what humans expect and makes it easier to have this conversation."

Looking ahead, another big issue is how to get people to learn to do new things with their devices. There's basic interaction happening now with technologies like Siri and Cortana, but the next challenge is to get users to turn to AI first for answers, Black said.

Some users have been embarrassed talking to their phones but more comfortable talking to Amazon Echo because all they have to do is speak out loud in their homes. "People are treating it differently," he said. "It's there in the room with you."

Follow Us

Join the New Zealand Reseller News newsletter!

Error: Please check your email address.

Featured

Slideshows

Examining the changing job scene in the Kiwi channel

Examining the changing job scene in the Kiwi channel

Typically, the New Year brings new opportunities for personnel within the Kiwi channel. 2017 started no differently, with a host of appointments, departures and reshuffles across vendor, distributor and reseller businesses. As a result, the job scene across New Zealand has changed - here’s a run down of who is working where in the year ahead…

Examining the changing job scene in the Kiwi channel
​What are the top 10 tech trends for New Zealand in 2017?

​What are the top 10 tech trends for New Zealand in 2017?

Digital Transformation (DX) has been a critical topic for business over the last few years and IDC is now predicting a step change as DX reaches macroeconomic levels. By 2020 a DX economy will emerge and it will become the core of what New Zealand industries focus on. From the board level through to the C-Suite, Kiwi organisations must be prepared to think and act digital when the DX economy emerges in 2017.

​What are the top 10 tech trends for New Zealand in 2017?
Top 15 Kiwi tech storylines to follow in 2017

Top 15 Kiwi tech storylines to follow in 2017

​The New Year brings the usual new round of humdrum technology predictions, glaringly general, unashamedly safe and perpetually predictable. But while the industry no longer sees value in “cloud is now the norm” type projections, value can be found in following developments of the year previous, analysing behaviours and patterns to formulate a plan for the 12 months ahead. Consequently, here’s the top Kiwi tech storylines to follow in 2017...

Top 15 Kiwi tech storylines to follow in 2017
Show Comments