Google's DeepMind have revealed a new speech synthesis generator that will be used to help computer voices, like Siri and Cortana, sound more human.
谷歌旗下的人工智能公司DeepMind近日研制出了一种新型语音合成系统, 该技术可以让如Siri和Cortana这样的计算机合成语音听起来更接近真实人声。
Named WaveNet, the model works with raw audio waveforms to make our robotic assistants sound, err, less robotic.
这项名为WaveNet的技术通过研究原始音频波形,使机器人助手的声音听起来不那么像机器人。
WaveNet doesn't control what the computer is saying, instead it uses AI to make it sound more like a person, adding breathing noises, emotion and different emphasis into senteneces.
WaveNet并不会控制计算机的说话内容,它只会应用人工智能技术在句子中添加呼吸声、情感和各种重音,从而使计算机语音听起来更像真人。
Generating speech with computers is called text-to-speech (TTS) and up until now has worked by piecing together short pre-recorded syllables and sound fragments to form words.
用计算机合成语音的技术叫做“从文本到语音(TTS)”,现存的工作原理是将提前录制好的短音节和声音碎片合成语言。
As the words are taken from a database of speech fragments, it's very difficult to modify the voice, so adding things like intonation and emphasis is almost impossible.
由于语言是从语音碎片数据库中提取出来的,声音很难修饰,所以几乎不可能添加声调和重音等因素。
This is why robotic voices often sound monotonous and decidedly different from humans.
这就是为什么机器人语音听起来很生硬,明显和人声不同。
WaveNet however overcomes this problem, by using its neural network models to build an audio signal from the ground up, one sample at a time.
然而WaveNet克服了这个难关,利用神经元网络模型从头建立一个音频信号,每次生成一个样本。
During training the DeepMind team gave WaveNet real waveforms recorded from human speakers to learn from.
培训期间,DeepMind团队让WaveNet学习了一些真实记录的人类语音波形。
Using a type of AI called a neural network, the program then learns from these, much in the same way a human brain does.
通过一种叫做神经元网络的人工智能技术,这个系统可以像人类的大脑一样对这些波形进行学习。
The result was that the WaveNet learned the characteristics of different voices, could make non-speech sounds, such as breathing and mouth movements, and say the same thing in different voices.
所以WaveNet学习了不同声音的特点,可以发出非语言声音,比如呼吸声和嘴部活动的声音,并且可以用不同的声音说同样的内容。
Despite the exciting advancement, the system still requires a huge amount of processing power, which means it will be a while before the technology appears in the likes of Siri.
虽然这个系统有激动人心的进步,但是它需要很强大的处理能力,这意味着这项技术并不能很快应用到Siri当中。
Google's machine learning unit DeepMind is based in the UK and have previously made headlines when their computer beat the Go World champion earlier this year.
Google旗下的机器学习技术企业DeepMind总部设在英国,今年早些时候,他们的计算机因打败了围棋世界冠军而上了头条。
北京市东城区2016高考英语阅读理解学生联合自选(2)
英语阅读中如何找出隐含的主旨
2016届高考英语考前语法讲解:高考常考的定语从句八大类
2016届高考英语阅读理解考前突破:财经资讯世界银行——全球经济转折点
2016届高考英语阅读理解考前突破:财经资讯香港连续20年全球经济自由度指数最高
2016届高考英语考前语法讲解:使用被动语态应受哪些限制
2016届高考英语阅读理解考前突破:财经资讯中国财政部公布2016预算计划
2016届高考英语阅读理解考前突破:财经资讯乌克兰政局动荡引起小麦价格上涨
2016届高考英语考前名师热点预测:3 形容词和副词
2016届高考英语考前语法讲解:如何考查定语从句
2016届高考英语考前语法讲解:情态动词后接完成式
北京市东城区2016高考英语阅读理解学生联合自选(12)
2016届高考英语阅读理解考前突破:财经资讯雅虎首席运营官意外离职
北京市东城区2016高考英语阅读理解学生联合自选(10)
2016届高考英语阅读理解考前突破:财经资讯腾讯京东联姻意在对抗阿里巴巴
2016届高考英语考前名师热点预测:2 代词
金球奖获奖感言不够“得体” 抖森发贴道歉
生活小常识:浴巾到底该多久洗一次?
北京市东城区2016高考英语阅读理解学生联合自选(8)
2016届高考英语考前语法讲解:冠词详解
金球奖上基情四射!死侍、蜘蛛侠深情接吻!
北京市东城区2016高考英语阅读理解学生联合自选(3)
双语美文阅读:生活的美好始于你能怀拥微笑
如果所有人都怎样世界就会好?
2016届高考英语阅读理解考前突破:财经资讯2016第四季度法国经济增长0.3%
2016届高考英语考前语法讲解:时态详解
北京市东城区2016高考英语阅读理解学生联合自选(1)
2016届高考英语考前语法讲解:宾语从句四注意
2016届高考英语阅读理解考前突破:财经资讯中国传统银行向互联网投资发起反击
北京市东城区2016高考英语阅读理解学生联合自选(6)
不限 |
英语教案 |
英语课件 |
英语试题 |
不限 |
不限 |
上册 |
下册 |
不限 |