hydra

Author	SHA1	Message	Date
Eelco Dolstra	44e1efff7f	Send the right nix-serve client version We were using protocol version 6 but requesting version 4. The only reason that this worked was because of a broken version check in 'nix-store --serve'. That was fixed in `c2d7456926`, which had the side-effect of breaking hydra-queue-runner.	2022-09-08 11:51:13 +02:00
Eelco Dolstra	b16470c544	nix flake info -> nix flake metadata This gets rid of a deprecation warning.	2022-09-06 19:23:20 +02:00
Janne Heß	371402c3c1	Drop the HipChat plugin https://en.wikipedia.org/wiki/HipChat says: > Following this, HipChat and Stride customers were migrated to the > Slack group collaboration platform in a transition that was completed by > February 2019.	2022-08-20 19:16:43 +02:00
Marco Rebhan	a58e2f1a64	Use libmagic for better output MIME detection	2022-08-04 22:34:58 +02:00
Janne Heß	e2756042b8	Merge pull request #965 from helsinki-systems/css_more_content Fit more content on screen	2022-07-13 23:47:04 +02:00
Janne Heß	e05118171b	Merge pull request #1229 from helsinki-systems/fix/nix-cat-store replace nix cat-store with nix store cat	2022-07-01 13:28:27 +02:00
ajs124	bb1f04ed86	AddBuilds: fix declarative jobsets with dynamic runcommand enabled $project->{enable_dynamic_run_command} is undefined	2022-06-30 01:49:30 +02:00
ajs124	bab671124d	replace nix cat-store with nix store cat the former was deprecated in favor of the latter	2022-06-30 00:24:09 +02:00
Maximilian Bosch	5c01800fbe	flake: Update Nix to 2.9.1 NOTE: I'm well-aware that we have to be careful with this to avoid new regressions on hydra.nixos.org, so this should only be merged after extensive testing from more people. Motivation: I updated Nix in my deployment to 2.9.1 and decided to also update Hydra in one go (and compile it against the newer Nix). Given that this also updates the C++ code in `hydra-{queue-runner,eval-jobs}` this patch might become useful in the future though.	2022-06-16 14:54:57 +02:00
Maximilian Bosch	a8b590014b	Fix email notifications for jobsets w/git-inputs I started to wonder quite recently why Hydra doesn't send email notifications anymore to me. I saw the following issue in the log of `hydra-notify.service`: May 22 11:57:29 hydra 9bik0bxyxbrklhx6lqwifd6af8kj84va-hydra-notify[1887289]: fatal: unsafe repository ('/var/lib/hydra/scm/git/3e70c16c266ef70dc4198705a688acccf71e932878f178277c9ac47d133cc663' is owned by someone else) May 22 11:57:29 hydra 9bik0bxyxbrklhx6lqwifd6af8kj84va-hydra-notify[1887289]: To add an exception for this directory, call: May 22 11:57:29 hydra 9bik0bxyxbrklhx6lqwifd6af8kj84va-hydra-notify[1887289]: git config --global --add safe.directory /var/lib/hydra/scm/git/3e70c16c266ef70dc4198705a688acccf71e932878f178277c9ac47d133cc663 May 22 11:57:29 hydra 9bik0bxyxbrklhx6lqwifd6af8kj84va-hydra-notify[1886654]: error running build_finished hooks: command `git log --pretty=format:%H%x09%an%x09%ae%x09%at b0c30a7557685d25a8ab3f34fdb775e66db0bc4c..eaf28389fcebc2beca13a802f79b2cca6e9ca309 --git-dir=.git' failed with e> This is also a problem because of Git's fix for CVE-2022-24765[1], so I applied the same fix as for Nix[2], by using `--git-dir` which skips the code-path for the ownership-check[3]. [1] https://lore.kernel.org/git/xmqqv8veb5i6.fsf@gitster.g/ [2] https://github.com/NixOS/nix/pull/6440 [3] To quote `git(1)`: > Specifying the location of the ".git" directory using this option > (or GIT_DIR environment variable) turns off the repository > discovery that tries to find a directory with ".git" subdirectory	2022-05-22 14:14:14 +02:00
Ulrik Strid	3c71be5b5b	GithubPulls: Don't fail on missing `Link`	2022-05-18 08:14:00 +02:00
Kayla Firestack	2cdd7974de	fix(hydra-eval-jobs): fix typo	2022-04-29 13:06:16 -04:00
Kayla Firestack	62cdbc4138	feat(hydra-eval-jobs.cc): add check_pid_status_nonblocking to catch handler	2022-04-21 10:55:51 -04:00
Kayla Firestack	cb4fa0000f	fix(hydra-eval-jobs.cc): add function to report pid status	2022-04-21 10:55:51 -04:00
Graham Christensen	5c90edd19f	Merge pull request #1103 from DeterminateSystems/runcommand/dynamic Dynamic RunCommand	2022-04-19 10:09:47 -04:00
Graham Christensen	e1965250b5	Merge pull request #1173 from DeterminateSystems/queue-runner-exporter hydra-queue-runner metrics	2022-04-07 12:27:33 -04:00
Cole Helbling	f8dc48f171	hydra-queue-runner: fixup: remove extraneous newline	2022-04-06 17:53:11 -07:00
Graham Christensen	59ac96a99c	Track the number of steps created	2022-04-06 20:23:02 -04:00
Graham Christensen	1c12c5882f	hydra queue runner: instrument the process of loading new builds with prom	2022-04-06 20:18:29 -04:00
Graham Christensen	5de08d412e	queue metrics: refactor the metrics into a struct	2022-04-06 20:00:30 -04:00
Graham Christensen	46f52b4c4e	bring back the working version Cole made	2022-04-06 15:49:38 -04:00
Cole Helbling	5bff730f2c	WIP: I love it when they delete the assignment operator :)	2022-04-06 11:41:40 -07:00
Cole Helbling	edf3c348f2	hydra-queue-runner: make entire address configurable	2022-04-06 10:59:45 -07:00
Cole Helbling	33bc60b83c	hydra-queue-runner: move exporter back to State::run It's (arguably) better than risking pinning the thread at 100% due to the busy `while` loop.	2022-04-06 10:49:14 -07:00
Eelco Dolstra	71a036ed00	Update to Nix master Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/ec90fc4d1f42db3c5e3c74dc186487d10a28c221' (2022-04-05) → 'github:NixOS/nix/5fe4fe823c193cbb7bfa05a468de91eeab09058d' (2022-04-05) • Updated input 'nix/nixpkgs': 'github:NixOS/nixpkgs/82891b5e2c2359d7e58d08849e4c89511ab94234' (2021-09-28) → 'github:NixOS/nixpkgs/530a53dcbc9437363471167a5e4762c5fcfa34a1' (2022-02-19)	2022-04-05 17:31:30 +02:00
Cole Helbling	8c5636fe18	hydra-queue-runner: use port 9198 by default Co-authored-by: Graham Christensen <graham@grahamc.com>	2022-04-02 17:32:14 -07:00
Eelco Dolstra	bcaad1c934	openConnection(): Don't throw exceptions in forked child On hydra.nixos.org the queue runner had child processes that were stuck handling an exception: Thread 1 (Thread 0x7f501f7fe640 (LWP 1413473) "bld~v54h5zkhmb3"): #0 futex_wait (private=0, expected=2, futex_word=0x7f50c27969b0 <_rtld_local+2480>) at ../sysdeps/nptl/futex-internal.h:146 #1 __lll_lock_wait (futex=0x7f50c27969b0 <_rtld_local+2480>, private=0) at lowlevellock.c:52 #2 0x00007f50c21eaee4 in __GI___pthread_mutex_lock (mutex=0x7f50c27969b0 <_rtld_local+2480>) at ../nptl/pthread_mutex_lock.c:115 #3 0x00007f50c1854bef in __GI___dl_iterate_phdr (callback=0x7f50c190c020 <_Unwind_IteratePhdrCallback>, data=0x7f501f7fb040) at dl-iteratephdr.c:40 #4 0x00007f50c190d2d1 in _Unwind_Find_FDE () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #5 0x00007f50c19099b3 in uw_frame_state_for () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #6 0x00007f50c190ab90 in uw_init_context_1 () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #7 0x00007f50c190b08e in _Unwind_RaiseException () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #8 0x00007f50c1b02ab7 in __cxa_throw () from /nix/store/dd8swlwhpdhn6bv219562vyxhi8278hs-gcc-10.3.0-lib/lib/libstdc++.so.6 #9 0x00007f50c1d01abe in nix::parseURL (url="root@cb893012.packethost.net") at src/libutil/url.cc:53 #10 0x0000000000484f55 in extraStoreArgs (machine="root@cb893012.packethost.net") at build-remote.cc:35 #11 operator() (__closure=0x7f4fe9fe0420) at build-remote.cc:79 ... Maybe the fork happened while another thread was holding some global stack unwinding lock (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71744). Anyway, since the hanging child inherits all file descriptors to SSH clients, shutting down remote builds (via 'child.to = -1' in State::buildRemote()) doesn't work and 'child.pid.wait()' hangs forever. So let's not do any significant work between fork and exec.	2022-03-30 22:39:48 +02:00
ajs124	089da272c7	fix build against nix 2.7.0 fix build after such commits as df552ff53e68dff8ca360adbdbea214ece1d08ee and e862833ec662c1bffbe31b9a229147de391e801a	2022-03-29 15:38:24 -04:00
ajs124	c64c5f0a7e	hydra-queue-runner: rename build-result.hh to hydra-build-result.hh	2022-03-29 15:34:29 -04:00
Graham Christensen	3b048ed136	Revert "Revert "Use `copyClosure` instead of `computeFSClosure` + `copyPaths`"" This reverts commit `8e3ada2afc`.	2022-03-29 15:28:47 -04:00
Cole Helbling	4789eba92c	hydra-queue-runer: split metrics functionality into its own function	2022-03-29 10:55:28 -07:00
Cole Helbling	928b3b8268	hydra-queue-runner: fix priority of flag over config file	2022-03-29 10:42:07 -07:00
Cole Helbling	5ddb9a98ca	fixup! hydra-queue-runner: log message before and after exporter is started	2022-03-29 08:47:41 -07:00
Cole Helbling	905a7a7beb	hydra-queue-runner: read metrics port from `queue_runner_metrics_port` config	2022-03-29 08:46:43 -07:00
Cole Helbling	9cdc5aceed	hydra-queue-runner: log message before and after exporter is started This way, if something goes wrong between the two, it's easier to narrow down where the issue lies.	2022-03-29 08:41:19 -07:00
Cole Helbling	8e3ada2afc	Revert "Use `copyClosure` instead of `computeFSClosure` + `copyPaths`" This reverts commit `f14c583ce5`.	2022-03-28 09:54:02 -07:00
Eelco Dolstra	962bf36939	Merge pull request #1162 from obsidiansystems/less-ref Make `copyClosureTo` take a regular C++ ref to the store	2022-03-23 16:25:59 +01:00
Eelco Dolstra	3390415905	Merge pull request #1125 from obsidiansystems/simplify--copyClosure Use `copyClosure` instead of `computeFSClosure` + `copyPaths`	2022-03-23 12:49:22 +01:00
Cole Helbling	8503a7917b	fixup! hydra-queue-runner: make registry member of State, configurable metrics port	2022-03-22 13:38:13 -07:00
Graham Christensen	e5393c2cf8	fixup: make id non-ambiguous	2022-03-19 23:56:47 -04:00
Graham Christensen	137be3452e	Reduce the jobset cols on the remaining two queries	2022-03-19 23:56:47 -04:00
Graham Christensen	f353a7ac41	update-gc-roots: try subselecting the jobset table	2022-03-19 23:56:47 -04:00
Graham Christensen	145667cb53	hydra-update-gc-roots: allow cached refs to the build's jobset Re-executing this search_related on every access turned out to create very problematic performance. If a jobset had a lot of error output stored in the jobset, and there were many hundreds or thousands of active jobs, this could easily cause >1Gbps of network traffic.	2022-03-19 23:56:47 -04:00
Graham Christensen	a582e4c485	HydraTestContext: add \n's to various dies	2022-03-19 14:46:53 -04:00
Graham Christensen	074a2f96bf	hydra-eval-jobset: emit a useful error if constituents errored	2022-03-19 14:37:12 -04:00
Cole Helbling	c0f826b92d	hydra-queue-runner: get the listening port from the exposer itself Otherwise, when the port is randomly chosen (e.g. by specifying no port, or a port of 0), it will just show that the port is 0 and not the port that is actually serving the metrics.	2022-03-14 08:41:45 -07:00
Cole Helbling	52a29d43e6	hydra-queue-runner: make registry member of State, configurable metrics port Thanks to the updated prometheus-cpp library, specifying a port of 0 will cause it to pick a random (available) port -- ideal for tests.	2022-03-11 11:58:10 -08:00
Cole Helbling	3bf31bd6a6	hydra-queue-runner: add simple "up" exporter There are probably better ways to achieve this (and will likely need to be refactored a bit to support further metrics).	2022-03-10 12:36:58 -08:00
Graham Christensen	9316544abf	src/hydra-eval-jobs/hydra-eval-jobs.cc: .get<std::string> for drvPath Co-authored-by: Kayla Fire <firestack@users.noreply.github.com>	2022-02-21 12:41:21 -05:00
Graham Christensen	290e0653ad	hydra-eval-jobs: GC root aggregate jobs	2022-02-20 12:28:40 -05:00

1 2 3 4 5 ...

2877 commits