28th Weekly Operation Report on DIRAC Distributed Computing YAN Tian From 2015-07-08 to 2015-07-15
Weekly Running Jobs by User item value active users 1 max running jobs 328 average running jobs 319 total executed jobs 11.8 k Notes: CEPC production user weiyq keeps running jobs. No BES user run jobs this week.
Final Status of Running Jobs StoRM DB problem. Fixed Failed Reason percent upload/download failed 15.5% stalled 0.39% application error 0.93% other
Output Data Generated and Transfered quality: good except IHEP-STORM downtime; WHU-USER act as failover SE Total: 3.38 TB ~0.483 TB/day
Running job by Site 3 sites in production: : OpenStack, OpenNebula WHU
Job Final Status at Each Site (inputSandbox error and pending request ignored) OpenStack, 2165 jobs, 99.0% done, WHU, 4990 jobs, 98.9% done OpenNebula, 2862 jobs, 97.1% done,
Failed Types at Site: Description All sites are good this week.
Cumulative User Jobs Total user jobs: 11.8 k weiyq 100%
本周运维日志 7.8 小 Marco 回信说,GRID.INFN 站点重新部署了,正在调试。改成 elastic VM 了。 7.9 StoRM SE 由于改配置后重启顺序不对,数据库里出现了一些错误的状态信息。导致数据传输成功率降为 50%,清理数据库之后没有问题了。CEPC作业受到了影响。 7.12 StoRM SE frontend 最大线程数改为 200, 运行正常。