Since the start of the year, a team of researchers at Carnegie Mellon University supported by grants from the Defense Advanced Research Projects Agency and Google, and tapping into a research supercomputing cluster provided by Yahoo has been fine-tuning a computer system that is trying to master semantics by learning more like a human. Its beating hardware heart is a sleek, silver-gray computer calculating 24 hours a day, seven days a week that resides in a basement computer center at the university, in Pittsburgh. The computer was primed by the researchers with some basic knowledge in various categories and set loose on the Web with a mission to teach itself.
For all the advances in computer science, we still dont have a computer that can learn as humans do, cumulatively, over the long term, said the teams leader, Tom M. Mitchell, a computer scientist and chairman of the machine learning department.
The Never-Ending Language Learning system, or NELL, has made an impressive showing so far. NELL scans hundreds of millions of Web pages for text patterns that it uses to learn facts, 390,000 to date, with an estimated accuracy of 87 percent. These facts are grouped into semantic categories cities, companies, sports teams, actors, universities, plants and 274 others. The category facts are things like San Francisco is a city and sunflower is a plant.
NELL also learns facts that are relations between members of two categories. For example, Peyton Manning is a football player . The Indianapolis Colts is a football team . By scanning text patterns, NELL can infer with a high probability that Peyton Manning plays for the Indianapolis Colts even if it has never read that Mr. Manning plays for the Colts. Plays for is a relation, and there are 280 kinds of relations. The number of categories and relations has more than doubled since earlier this year, and will steadily expand.
The learned facts are continuously added to NELLs growing database, which the researchers call a knowledge base. A larger pool of facts, Dr. Mitchell says, will help refine NELLs learning algorithms so that it finds facts on the Web more accurately and more efficiently over time.
NELL is one project in a widening field of research and investment aimed at enabling computers to better understand the meaning of language. Many of these efforts tap the Web as a rich trove of text to assemble structured ontologies formal descriptions of concepts and relationships to help computers mimic human understanding. The ideal has been discussed for years, and more than a decade ago Sir Tim Berners-Lee, who invented the underlying software for the World Wide Web, sketched his vision of a semantic Web.
Today, ever-faster computers, an explosion of Web data and improved software techniques are opening the door to rapid progress. Scientists at universities, government labs, Google, Microsoft, I.B.M. and elsewhere are pursuing breakthroughs, along somewhat different paths.
For example, I.B.M.s question answering machine, Watson, shows remarkable semantic understanding in fields like history, literature and sports as it plays the quiz show Jeopardy! Google Squared, a research project at the Internet search giant, demonstrates ample grasp of semantic categories as it finds and presents information from around the Web on search topics like U.S. presidents and cheeses.
Still, artificial intelligence experts agree that the Carnegie Mellon approach is innovative. Many semantic learning systems, they note, are more passive learners, largely hand-crafted by human programmers, while NELL is highly automated. Whats exciting and significant about it is the continuous learning, as if NELL is exercising curiosity on its own, with little human help, said Oren Etzioni, a computer scientist at the University of Washington, who leads a project called TextRunner, which reads the Web to extract facts.
Computers that understand language, experts say, promise a big payoff someday. The potential applications range from smarter search to virtual personal assistants that can reply to questions in specific disciplines or activities like health, education, travel and shopping.
The technology is really maturing, and will increasingly be used to gain understanding, said Alfred Spector, vice president of research for Google. Were on the verge now in this semantic world.
With NELL, the researchers built a base of knowledge, seeding each kind of category or relation with 10 to 15 examples that are true. In the category for emotions, for example: Anger is an emotion. Bliss is an emotion. And about a dozen more.
牛津实用英语语法 74指人的限定性关系从句
牛津实用英语语法:120 had better+不带to的不定式
牛津实用英语语法:127 may用来表示许可
牛津实用英语语法:119 have+宾语+过去分词
牛津实用英语语法:99 介词/副词
牛津实用英语语法:106 助动词及情态动词
牛津实用英语语法:130 could或 was/were allowed to
牛津实用英语语法 58 what的用法
牛津实用英语语法:97 动词和介词
牛津实用英语语法:88 间接宾语前to和for的省略
牛津实用英语语法:134 could替代may/might的用法
牛津实用英语语法:82 连接性关系从句
牛津实用英语语法 83 what(关系代词)和which(连接关系词)
牛津实用英语语法:136 can和 be able的各种形式
牛津实用英语语法:131 请求许可
牛津实用英语语法:110 附加疑问
牛津实用英语语法:117 it is和there is的比较
牛津实用英语语法:100 动词的分类
牛津实用英语语法:95 above, over, under, below, beneath等
牛津实用英语语法:124 形式
牛津实用英语语法:104 表示疑问和请求的疑问式
牛津实用英语语法:122 have意指possess(拥有)
牛津实用英语语法:137 can/am able,could/was able
牛津实用英语语法:107 助动词:形式与句型
牛津实用英语语法:96 介词与形容词、分词连用
牛津实用英语语法:92 表示时间的介词to,till/until
牛津实用英语语法:135 can表示可能
牛津实用英语语法 70反身代词
牛津实用英语语法:113 构成各种时态的形式及用法
牛津实用英语语法:116 there is/are/was/were等
| 不限 |
| 英语教案 |
| 英语课件 |
| 英语试题 |
| 不限 |
| 不限 |
| 上册 |
| 下册 |
| 不限 |