QQ登录

只需要一步,快速开始

 注册地址  找回密码
查看: 2816|回复: 2
打印 上一主题 下一主题

每日科技报告 第76期 New Automated Tool 'Debugs' Nuclear Weapon Simulations

[复制链接]
字体大小: 正常 放大

522

主题

10

听众

4072

积分

升级  69.07%

  • TA的每日心情
    奋斗
    2015-1-3 17:18
  • 签到天数: 6 天

    [LV.2]偶尔看看I

    自我介绍
    学习中!

    优秀斑竹奖 元老勋章 新人进步奖 最具活力勋章

    群组Matlab讨论组

    群组C 语言讨论组

    群组每天多学一点点

    群组数学趣味、游戏、IQ等

    群组南京邮电大学数模协会

    跳转到指定楼层
    1#
    发表于 2010-6-3 20:42 |只看该作者 |倒序浏览
    |招呼Ta 关注Ta
    本帖最后由 sea_star666 于 2010-6-3 20:48 编辑

    Purdue University researchers, working with high-performance computing experts at Lawrence Liver more National Laboratory, have created anautomated program to "debug" simulations used to more efficiently certify the nation's nuclear weapons.

    The program, called AutomaDeD (pronounced like automated), finds errors in computer code for complex "parallel" programs.
    "The simulations take several weeks to run, and then they have to be debugged to correct errors in the code," said Saurabh Bagchi, an associate professor in Purdue's School of Electrical and Computer Engineering. "The error might have occurred in the first hour of operation, and if you had known about it you could have stopped it then."
    Because international treaties forbid the detonation of nuclear test weapons, certification is done using complex simulations. The simulations, which may contain as many as 100,000 lines of computer code, must accurately show reactions taking place on the scale of milliseconds, or thousandths of a second.
    "Many times an error in a simulation code may not become evident until long after it occurs," said Bronis R. de Supinski, co-leader of the ASC Application Development Environment Performance Team at the U.S. Department of Energy's Lawrence Liver more National Laboratory."These delays are challenging since they make the actual location of the bug unclear."
    In parallel operations used for powerful simulation tools, a highly complex job is split into numerous smaller and more manageable processes that are handled by separate machines in large computer clusters. After the computers complete their individual processes, all of the parallel results are combined.
    Conventional debugging programs, however, must be operated manually,with engineers navigating through a large number of processes.
    "Debuggers have worked well for sequential applications," Bagchi said. "But when we extend these to large parallel applications,application developers are not very happy because it's very time consuming and difficult to do the manual debugging. It is just difficult for human cognitive abilities to keep track of what is going on simultaneously in many processes and determine what is anomalous."
    So, to enable the automatic debugging of the simulations, there searchers created AutomaDeD, which stands for automata-based debugging for dissimilar parallel tasks.
    "The idea is to use AutomaDeD as the simulation is running to automatically monitor what's happening," Bagchi said. "If things start going wrong, AutomaDeD would stop and flag which process and which part of the code in the process is likely anomalous."
    Errors in software code cause "stalls" and "hangs" that slow or halt simulations or give incorrect results. Another problem with parallel programs is interference from software that previously ran on the samecomputer clusters but were not properly expunged before the new jobstarted running.
    Recent research findings show AutomatDeD was 90 percent accurate in identifying the time "phase" when stalls and hangs occurred; 80 percent accurate in identifying the specific tasks that were the sources for stalls and hangs; and 70 percent accurate in identifying the interference faults.
    The findings will be detailed in a research pa-per to be presented on June 30 during the 40th Annual IEEE/IFIP International Conference on Dependable Systems and Networks in Chicago. The pap-er was written byPurdue doctoral student Ignacio Laguna, Bagchi, and Lawrence Liver morescientists Greg Bronevetsky, de Supinski, Dong H. Ahn and MartinSchulz. The primary developers of the program are Bronevetsky andLaguna.
    The same debugging approach could be used to find errors in other parallel applications, such as those used in climate modeling and high-energy particle physics.
    AutomaDeD works first by grouping the large number of processes in to a smaller number of "equivalence classes" with similar traits. Grouping the processes into equivalence classes keeps the analysis sim-ple enough that it can be done while the simulation is running.
    AutomataDeD also works by splitting a simulation into numerous windows of time, called phases.
    "So our tool lets you know if the error occurs for task 1 and task 5in phase 153 and allows you to zoom in and find the specific part of the code that is problematic," Bagchi said.
    Large computer clusters operated by Lawrence Liver more containing thousands of processors have been used for the debugging operations.
    Purdue researchers did not work with the actual classified nuclear weapons software code but instead used generic "NAS parallel benchmarks," a set of programs designed to help evaluate the performance of parallel supercomputers developed by the NASA Advanced Supercomputing division.
    The work began a year ago, and the new debugging program is currently being used by the federal lab to detect certain types of errors. The researchers are continuing to improve the program. The work is funded by the Department of Energy.


    zan
    转播转播0 分享淘帖0 分享分享0 收藏收藏0 支持支持1 反对反对0 微信微信
    第一次用linux登录madio,纪念一下
    杜增 实名认证       

    5

    主题

    3

    听众

    1071

    积分

  • TA的每日心情
    开心
    2012-5-9 14:48
  • 签到天数: 11 天

    [LV.3]偶尔看看II

    2012挑战赛参赛者

    新人进步奖

    群组破解难题

    群组Latex研学群

    群组快乐驿站

    群组数学建模培训课堂1

    群组南京邮电大学数模协会

    回复

    使用道具 举报

    0

    主题

    4

    听众

    1156

    积分

    升级  15.6%

  • TA的每日心情
    开心
    2012-11-30 12:23
  • 签到天数: 2 天

    [LV.1]初来乍到

    自我介绍
    我是一名信息与计算科学的学生,我喜欢数学,我学习努力,工作认真。能吃苦。自制力强,很想和人讨论数学。

    不支持自定义 Discuz! 代码

    新人进步奖

    群组数学建模

    回复

    使用道具 举报

    您需要登录后才可以回帖 登录 | 注册地址

    qq
    收缩
    • 电话咨询

    • 04714969085
    fastpost

    关于我们| 联系我们| 诚征英才| 对外合作| 产品服务| QQ

    手机版|Archiver| |繁體中文 手机客户端  

    蒙公网安备 15010502000194号

    Powered by Discuz! X2.5   © 2001-2013 数学建模网-数学中国 ( 蒙ICP备14002410号-3 蒙BBS备-0002号 )     论坛法律顾问:王兆丰

    GMT+8, 2026-4-11 01:57 , Processed in 1.036523 second(s), 64 queries .

    回顶部