Steve Kilduff
2014-Jun-03 15:21 UTC
[Puppet Users] "puppet agent: applying configuration" success differs to manual command line "puppet agent -t"
Hi guys, I've searched but not found what I'm looking for, sorry if this has been asked before. Background: I am trying to monitor puppet run success by monitoring the file /var/lib/puppet/state/last_run_summary.yaml. Then I am trying to break a puppet run, by temporarily removing a manifest on the puppet master, which is needed by a client. This is my test to see if the check works and gets caught by our monitoring system. A puppet agent -t looks like: {code} puppet agent -t Info: Retrieving plugin Info: Loading facts in /var/lib/puppet/lib/facter/filesystems.rb Info: Loading facts in /var/lib/puppet/lib/facter/postgres_default_version.rb Info: Loading facts in /var/lib/puppet/lib/facter/root_home.rb Info: Loading facts in /var/lib/puppet/lib/facter/rabbitmq_erlang_cookie.rb Info: Loading facts in /var/lib/puppet/lib/facter/ip6tables_version.rb Info: Loading facts in /var/lib/puppet/lib/facter/concat_basedir.rb Info: Loading facts in /var/lib/puppet/lib/facter/iptables_persistent_version.rb Info: Loading facts in /var/lib/puppet/lib/facter/puppet_vardir.rb Info: Loading facts in /var/lib/puppet/lib/facter/iptables_version.rb Info: Loading facts in /var/lib/puppet/lib/facter/os_maj_version.rb Info: Loading facts in /var/lib/puppet/lib/facter/facter_dot_d.rb Info: Loading facts in /var/lib/puppet/lib/facter/pe_version.rb Error: Could not retrieve catalog from remote server: Error 400 on SERVER: Could not find class role::ouf for ov28.fqdn on node ov28.fqdn Warning: Not using cache on failed catalog Error: Could not retrieve catalog; skipping run {code} And then, I run my monitor to see if it detects a broken run: sudo -u xymon sudo /usr/libexec/xymon/client/ext/check_puppet.rb -w 2000 -c 3600 CRITICAL: FAILED - Puppet failed to run. Missing dependencies? Catalog compilation failed? Last run 23 seconds ago|time_since_last_run=23s;2000;3600;0 failed_resources=99;;;0 failed_events=99;;;0 Great, the check detects that puppet has failed. The last_run_summary looks like this after the run: cat /var/lib/puppet/state/last_run_summary.yaml --- version: config: puppet: "3.4.3" time: last_run: 1401807503 However. After puppet agent schedules a puppet run, I do not get the same errors. The contents of last_run_summary.yaml look like a normal puppet run has completed successfully: cat /var/lib/puppet/state/last_run_summary.yaml --- changes: total: 0 version: puppet: "3.4.3" config: 1401798243 time: last_run: 1401808053 anchor: 0.002382 total: 227.941278069473 exec: 0.552989 datacat_fragment: 0.00575 mount: 0.001974 ssh_authorized_key: 0.025437 schedule: 0.000933 package: 0.542415 datacat_collector: 0.012692 user: 0.130179 host: 0.000364 filebucket: 0.000187 file: 220.198688 config_retrieval: 1.89250206947327 service: 4.57266 group: 0.002126 resources: changed: 0 failed_to_restart: 0 total: 513 out_of_sync: 0 skipped: 0 restarted: 0 failed: 0 scheduled: 0 events: failure: 0 total: 0 success: 0 And so the monitor does not pick up the errors. Any ideas? What am I doing wrong? Thanks in advance :) -- You received this message because you are subscribed to the Google Groups "Puppet Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to puppet-users+unsubscribe@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-users/0c316d9a-c636-4e28-a1d7-af20faa82558%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.