It would be useful to know what zuul executor ran a job so that we can
track potential executor specific issues back to that executor via
logstash. Add the zuul executors hostname to the log processing payload
which will result in us storing it in elasticsearch.
Change-Id: Ifbd7c7f91a006bd63e3df6053297ccc36bb3240b
Be more explicit in the logstash job submissions role library code about
why we can get away with treating foo.txt and foo.txt.gz as the same
content.
Change-Id: I44a7310f7aa149919cc9aa38510329ce879c8f17
We compress log files early so that we minimize network traffic and disk
storage requirements. But logically foo.txt and foo.txt.gz are identical
for our purposes when hosting log files. You can ask apache for either
and should get the same content.
This creates problems when indexing logs because we had been treating
the non .gz suffix'ed files as the logical names but now we don't have
them in all cases in ES because we submit jobs for the .gz file
resulting in the name in ES having the .gz suffix. Additionally if we
have both foo.txt and foo.txt.gz on disk when we generate the log
indexing jobs we'll submit two jobs to index the same log file contents.
We can address both these problems if we early on treat the non .gz as
the "logical" name and only submit jobs for the no suffix names.
Change-Id: Id99d6bc58f94c197b567ec4080ab24715128d855
fix a "bug" in that the flake8 configuration in tox.ini was exclusively
selecting H231 as the only error it would report, so it was missing the
errors in the python modules (such as submit_log_processor_jobs). Due to
this being the case for a long time (since 2004) limit the more thorough
linting to the roles/ and playbooks/ directories where we'll be adding
ansible plugins/modules/etc. Also, lint in jenkins/script and nodepool.
Fix problems found.
We can lint everything with pep8 once the zuul v2 scripts are removed,
not worth patching them right now.
Change-Id: I479f010643cf3b67c183d763510f07a33400d38b
Co-Authored-By: Jesse Keating <omgjlk@us.ibm.com>
The log server may have just been added to the inventory; we're
really interested in the node the job ran on, so make sure that
the node we grab is a nodepool node.
Change-Id: I3f228f9a0c913476f946f546cdd5a2d56864c549
For easier debugging. Also, override the invocation kwarg so that
ansible doesn't automatically log the arguments to this module
(since they include all of the hostvars which include way too much
data).
Change-Id: I415da15dd1dc154e08e159453235448c90a5e5d2
These roles replace the jenkins-log-client portion of the
logstash/subunit queue processing system. Much of the code is
derived from the jenkins-log-client.
Rather than subscribing to ZMQ events emitted from jenkins (and
later zuul v2.5), these roles are intended to be run in a late base
post playbook (right after log uploading). They examine the logs
directory which contains the files that were just uploaded and any
matching files have jobs submitted to the gearman queue. Currently
the module attempts to maintain compatability as much as possible,
but eventually we should represent some of the data in a more v3-like
manner, as well as consider adding additional data. Some suggestions
are noted in comments.
Change-Id: I10ea613d9278465b90d891371d2626b9e99f8f31