QQ登录

只需要一步,快速开始

 注册地址  找回密码
查看: 3626|回复: 0
打印 上一主题 下一主题

每日科技报告 第46期 Computer Scientists Create a Multilingual Search Engine

[复制链接]
字体大小: 正常 放大

522

主题

10

听众

4072

积分

升级  69.07%

  • TA的每日心情
    奋斗
    2015-1-3 17:18
  • 签到天数: 6 天

    [LV.2]偶尔看看I

    自我介绍
    学习中!

    优秀斑竹奖 元老勋章 新人进步奖 最具活力勋章

    群组Matlab讨论组

    群组C 语言讨论组

    群组每天多学一点点

    群组数学趣味、游戏、IQ等

    群组南京邮电大学数模协会

    跳转到指定楼层
    1#
    发表于 2010-3-19 21:27 |只看该作者 |倒序浏览
    |招呼Ta 关注Ta
    Researchers from the Validation and Business Applications Group (VAI) at the Universidad Politécnica de Madrid's School of Computing have developed a multilingual search engine to query a contents repository written in Interlingua using questions formulated in any language. The search engine returns a precise answer in the language in which the question was formulated.
    Computer Scientists Create a Multilingual Search Engine
    In response to the question, Why was Aubert awarded the Caméré prize?, for example, the search engine searches the repository and locates this graph. From this graph, it deduces the answer to the question, i.e. for a new type of movable dam.

    Interlingua is a language-independent contents representation. The United Nations' Universal Networking Language (UNL) is the only living, general-purpose Interlingua specified by standards, handbooks and governing organizations. UNL was created to break Internet language barriers. The VAI is the UNL support group for the Spanish language.
    The multilingual search engine is a question answering system that aims to return precise answers to questions about facts formulated in the user's mother tongue, e.g. Who discovered America and in which year? The answer to this question would be Christopher Columbus in 1492.
    Information in UNL
    The novelty of this system is that the question can be formulated in English, French, Spanish or any other language, and the system will return an answer formulated in this same language without any translation from source to target languages, because the information base that the system searches is written in UNL.
    Supposing that the answer is implicit in the question, the system exploits the features of the UNL representation of the user's question to find the answer. The search engine works by deducing the answer from the question rather than finding the answer to the question.
    The search engine is responsible for searching the text corpus written UNL to find the answer as follows. First, it searches the text corpus for statements that could contain the answer. Second, it determines which of this set of statements contains the answer, and what the answer is. It then generates the answer in the same language that the question was formulated in.
    In response to the question, Why was Aubert awarded the Caméré prize?, for example, the search engine searches the repository and locates a graph. From the graph, it deduces the answer to the question, i.e. for a new type of movable dam.
    Promising results
    Researchers used the UNESCO's biographical encyclopaedia as an information base for the exercise concerning the French engineer Jean Aubert (1894-1984). This encyclopaedia has 25 articles, which have been translated to UNL and contain 101 UNL expressions and 2534 universal words.
    The results of this research -82% precise answers- are very promising. A total of 75 different questions (when, how, who) were formulated, to which the right answer was known beforehand. Other questions for which the repository contained no answer were formulated to examine system behaviour in such cases. The results confirm the validity of this search engine for developing multilingual question answering systems.
    zan
    转播转播0 分享淘帖0 分享分享0 收藏收藏0 支持支持1 反对反对0 微信微信
    第一次用linux登录madio,纪念一下
    您需要登录后才可以回帖 登录 | 注册地址

    qq
    收缩
    • 电话咨询

    • 04714969085
    fastpost

    关于我们| 联系我们| 诚征英才| 对外合作| 产品服务| QQ

    手机版|Archiver| |繁體中文 手机客户端  

    蒙公网安备 15010502000194号

    Powered by Discuz! X2.5   © 2001-2013 数学建模网-数学中国 ( 蒙ICP备14002410号-3 蒙BBS备-0002号 )     论坛法律顾问:王兆丰

    GMT+8, 2025-8-15 00:47 , Processed in 0.466482 second(s), 55 queries .

    回顶部