Until April, Microsoft boasted of having the largest collection of faces that anyone could use to train facial-recognition algorithms. Since then, the once publicly-available dataset has quietly disappeared.
直到四月,微软都吹嘘拥有最大的人脸数据库,任何人都可以使用它来训练面部识别算法。而那之后,曾经公开可用的数据集已经悄然消失。
As the Financial Times reports, Microsoft quietly deleted the dataset after the paper called attention to privacy and ethical issues, including use of the dataset by military researcherss.
正如英国《金融时报》报道的那样,在该报引发了关于隐私和道德问题的关注之后(包括军事研究人员和中国监管公司使用数据集),微软悄然删除了数据集。
Microsoft did not immediately respond to a request for comment from Fortune. But it told the Financial Times: “The site was intended for academic purposes. It was run by an employee that is no longer with Microsoft and has since been removed.”
微软没有立即回复《财富》杂志的评论请求。但它告诉英国《金融时报》:“该网站是为了学术目的设立的。它由一名不再受雇于微软的员工运营,并且已经被删除。”
The now-deleted dataset contained more than 10 million faces culled from websites like Flickr, which host photographs uploaded under a Creative Commons license—meaning many can be used free of copyright concerns.
现已删除的数据集中包含超过1000万张面孔,这些面孔来自Flickr等网站,这些网站储存的是根据知识共享许可上传的照片——这意味着许多都可以免费,但可能有版权问题。
The name of the Microsoft dataset, MS Celeb, was chosen because many of the images it contains are famous people who live public lives. Many of the other faces in the set, however, belong to people who are not celebrities—including journalists and privacy researchers—and who were not aware their images had been included.
这个微软的数据集叫MS Celeb,之所以选择这个名称,是因为它包含的许多图像都是过着公开生活的名人。然而,该集中的许多其他面孔属于不是名人的人——包括记者和隐私研究人员——并且他们不知道他们的图像被包括在内。
Microsoft is hardly the only company to assemble large datasets by scraping photos from the open Internet. In January, IBM announced it was sharing a collection of 1 million faces in the name of promoting more diversity in artificial intelligence. Meanwhile, a website called Megapixels identifies several other massive collections as part of a bid to halt what it describes as a “growing crisis of authoritarian biometric surveillance.”
微软并不是唯一一家通过从开放的互联网上抓取照片来组装大型数据集的公司。今年1月,IBM宣布它正在以促进人工智能更多样化的名义共享100万张面孔。与此同时,一个名为Megapixels的网站确定了另外几个大型集合,以此来阻止它所谓的“威胁性的生物识别监视危机”。
While many of the facial recognition sets are culled from public websites like Flickr, that is not the only way companies obtain pictures of faces. As a recent Fortune investigation revealed, startups have been using photo collection apps to surreptitiously collect millions of faces, while other companies have been scanning public collections of mug shots.
虽然像Flickr这样的公共网站很多都剔除了面部识别装置,但这并不是公司获取面部图片的唯一方式。最近《财富》调查显示,创业公司一直在使用照片收集应用程序暗中收集数百万张面孔,而其他公司则一直在扫描大量的大头照。
上一篇: 美国人觉得英国哪些方面很奇怪
下一篇: 只是学了几句手语,现在我哭成了泪人儿!
春天来了:不花钱也能乐享春天
丹麦发现700年前厕所 现在依然特别臭!
年轻的求职者都会犯的10个错
美议员批马方贻误时机 搜寻不力
国际英语资讯:Explosions occur at ammunition depot in western Iraq
国内英语资讯:Chinas border region to promote AI cooperation with ASEAN
体坛英语资讯:Bolivia dump coach Villegas after Copa America flop
国际英语资讯:Africa supports goals of comprehensive nuke test ban treaty: envoy
为何好老板经常不开心
The Amazing School Life 美好的学校生活
体坛英语资讯:Chinese Football Association to elect new president next Thursday
成功人士睡前必做的9件事
国际英语资讯:Roundup: Italys PM Conte unveils new govt program to lower house, winning first confiden
40万英镑高薪美差:去西伯利亚数北极熊
马航事故削弱中国人赴马旅行意愿
伦敦市长头发凌乱 小朋友赠梳子请他多梳头
高蛋白饮食和寿命长短有关系吗
视频:萌童得知又添一妹妹嚎啕大哭
国内英语资讯:Xi stresses synergy, coordination, efficiency in advancing reform
机会来了你能抓住么?拿出你的最佳表现!
手机支付也麻烦?亚马逊将推“徒手”支付
体坛英语资讯:Barty into semi, Osaka injured at Cincinnati Masters
国内英语资讯:Interview: CPC gains intl respect with its achievements, Jordanian party leader says
天下之大无奇不有:盘点各种奇怪的工作
看诺贝尔经济学奖得主如何谈投资
中山装潮流强势回归
澳洲房价上涨是中国炒房团惹的祸?
揭秘导致加速衰老的5种原因
习主席里昂晚宴菜单曝光:欧洲王室最高规格
国内英语资讯:China to amend laws to further empower local legislatures