Hi!

We're using 7.0.1GA Network edition and admin console shows the service status of a MTA/proxy host as down. As per wiki, I'm posting outputs of the commands and hopefully, someone can help me understand where the issue lies.

There is a separate MTA+nginx proxy machine and MSS machine.

On the MSS:
zmloggerctl status; echo $?; ps ax | grep zmlogger
4307 ? S 0:00 /usr/bin/perl /opt/zimbra/libexec/zmlogger
4368 ? S 0:00 zmlogger: zmrrdfetch: server
14507 pts/1 R+ 0:00 grep zmlogger

/opt/zimbra/libexec/zmrrdfetch -f zmmtastats
[...]
1301327370,0,0,,0,0,0,0,0,0
1301327400,0,0,,0,0,0,0,0,0
1301327430,,,,,,,,,
1301327460,,,,,,,,,

/opt/zimbra/libexec/zmrrdfetch -c -f zmmtastats
[..]
1301326800,,,,,,,,,
1301327100,0,0,,0,0,0,0,0,0
1301327400,0,0,,0,0,0,0,0,0
1301327700,,,,,,,,,

grep -w MTA /var/log/zimbra-stats.log
Mar 28 17:50:11 zmss-stag zimbramon[14137]: 14137:info: MTA: stag.zcs.si: filter_misc,clam_events,mta_delay,mta_volume,filte r_virus,filter_count,mta_count,filter_spam,sendmai l_events:: 0,0,0,0,0,0,0,0,0

tail /var/log/zimbra-stats.log
[...]
Mar 28 17:52:06 zmss-stag zimbramon[14624]: 14624:info: 2011-03-28 17:52:01, STATUS: zmss-stag.xenya.si: stats: Running
Mar 28 17:52:06 zmss-stag zimbramon[14624]: 14624:info: 2011-03-28 17:52:01, STATUS: zmss-stag.xenya.si: zmconfigd: Running

zmsoap -z GetLoggerStatsRequest stats/@name=zmmtastats | head -20
<GetLoggerStatsResponse xmlns="urn:zimbraAdmin">
<hostname hn="zmss-stag.xenya.si">
<stats name="zmmtastats"/>
</hostname>
<hostname hn="stag.zcs.si">
<stats name="zmmtastats">
<values t="1301241180">
<stat name="filter_misc" value="0"/>
<stat name="clam_events" value="0"/>
<stat name="mta_delay" value=""/>
<stat name="mta_volume" value="0"/>
<stat name="filter_virus" value="0"/>
<stat name="filter_count" value="0"/>
<stat name="mta_count" value="0"/>
<stat name="filter_spam" value="0"/>
<stat name="sendmail_events" value="0"/>
[...]

zmsoap -z GetLoggerStatsRequest stats/@name=zmmtastats @limit=1 | head -50
<GetLoggerStatsResponse xmlns="urn:zimbraAdmin">
<hostname hn="zmss-stag.xenya.si">
<stats name="zmmtastats"/>
</hostname>
<hostname hn="stag.zcs.si">
<stats name="zmmtastats">
<values t="1301241300">
<stat name="filter_misc" value="0"/>
<stat name="clam_events" value="0"/>
<stat name="mta_delay" value=""/>
<stat name="mta_volume" value="0"/>
<stat name="filter_virus" value="0"/>
[..]

And on the proxy:
zmloggerctl status; echo $?; ps ax | grep zmlogger
zmlogswatchctl is not running <-- is this the problem??
1
12311 ? S 7:24 /usr/bin/perl /opt/zimbra/libexec/zmlogger
12401 ? S 0:00 zmlogger: zmrrdfetch: server
22721 pts/0 R+ 0:00 grep zmlogger

/opt/zimbra/libexec/zmrrdfetch -f zmmtastats
no structure found for zmmtastats at /opt/zimbra/libexec/zmrrdfetch line 239.

/opt/zimbra/libexec/zmrrdfetch -c -f zmmtastats
no structure found for zmmtastats at /opt/zimbra/libexec/zmrrdfetch line 239.

grep -w MTA /var/log/zimbra-stats.log
produces no results

tail /var/log/zimbra-stats.log
Mar 28 18:03:30 zmta-staging zimbramon[10740]: 10740:info: :::EC97D480-5954-11E0-B519-D8FF838DCE4F:::1:await, dm-1:svctm, dm-1:%util, dm-2:rrqm/s, dm-2:wrqm/s, dm-2:r/s, dm-2:w/s, dm-2:rkB/s, dm-2:wkB/s, dm-2:avgrq-sz, dm-2:avgqu-sz, dm-2:await, dm-2:svctm, dm-2:%util, dm-3:rrqm/s, dm-3:wrqm/s, dm-3:r/s, dm-3:w/s, dm-3:rkB/s, dm-3:wkB/s, dm-3:avgrq-sz, dm-3:avgqu-sz, dm-3:await, dm-3:svctm, dm-3:%util, dm-4:rrqm/s, dm-4:wrqm/s, dm-4:r/s, dm-4:w/s, dm-4:rkB/s, dm-4:wkB/s, dm-4:avgrq-sz, dm-4:avgqu-sz, dm-4:await, dm-4:svctm, dm-4:%util:: 03/28/2011 18:03:30, 0.00, 7.36, 0.10, 3.00, 0.30, 34.56, 22.49, 0.01, 3.06, 0.54, 0.17, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 7.36, 0.10, 3.00, 0.30, 34.56, 22.49, 0.01, 3.06, 0.54, 0.17, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.00, 0.03, 2.30, 0.03, 2.30, 2.00, 0.00, 0.47, 0.20, 0.05, 0.0:::EC97E254-5954-11E0-B519-D8FF838DCE4F:::

zmsoap -z GetLoggerStatsRequest stats/@name=zmmtastats | head -20
[] INFO: I/O exception (java.net.ConnectException) caught when processing request: Connection refused
[] INFO: Retrying request
java.net.ConnectException: Connection refused

Regards,
Miha.

ps: I tried starting zmlogswatchctl on the proxy:
zmlogswatchctl start
Starting logswatch...done.

but it obviously does not start, even though the PID file is present (though no PID with that number exists). There is a PID reference in zmswatch.out and this PID has a corresponding processsg 10541
zimbra 10541 0.0 0.9 33940 6956 pts/0 S 17:03 0:00 /usr/bin/perl /opt/zimbra/libexec/swatch --config-file=/opt/zimbra/conf/swatchrc --use-cpan-file-tail --script-dir=/opt/zimbra/data/tmp -t /var/log/zimbra.log
zimbra 10584 0.0 1.5 42856 11744 pts/0 S 17:03 0:00 /usr/bin/perl /opt/zimbra/data/tmp/.swatch_script.10541
zimbra 23104 0.0 0.0 6020 548 pts/0 R+ 18:01 0:00 grep 10541