hydra

Author	SHA1	Message	Date
Kayla Firestack	2cdd7974de	fix(hydra-eval-jobs): fix typo	2022-04-29 13:06:16 -04:00
Kayla Firestack	62cdbc4138	feat(hydra-eval-jobs.cc): add check_pid_status_nonblocking to catch handler	2022-04-21 10:55:51 -04:00
Kayla Firestack	cb4fa0000f	fix(hydra-eval-jobs.cc): add function to report pid status	2022-04-21 10:55:51 -04:00
Graham Christensen	5c90edd19f	Merge pull request #1103 from DeterminateSystems/runcommand/dynamic Dynamic RunCommand	2022-04-19 10:09:47 -04:00
Graham Christensen	e1965250b5	Merge pull request #1173 from DeterminateSystems/queue-runner-exporter hydra-queue-runner metrics	2022-04-07 12:27:33 -04:00
Cole Helbling	f8dc48f171	hydra-queue-runner: fixup: remove extraneous newline	2022-04-06 17:53:11 -07:00
Graham Christensen	59ac96a99c	Track the number of steps created	2022-04-06 20:23:02 -04:00
Graham Christensen	1c12c5882f	hydra queue runner: instrument the process of loading new builds with prom	2022-04-06 20:18:29 -04:00
Graham Christensen	5de08d412e	queue metrics: refactor the metrics into a struct	2022-04-06 20:00:30 -04:00
Graham Christensen	46f52b4c4e	bring back the working version Cole made	2022-04-06 15:49:38 -04:00
Cole Helbling	5bff730f2c	WIP: I love it when they delete the assignment operator :)	2022-04-06 11:41:40 -07:00
Cole Helbling	edf3c348f2	hydra-queue-runner: make entire address configurable	2022-04-06 10:59:45 -07:00
Cole Helbling	33bc60b83c	hydra-queue-runner: move exporter back to State::run It's (arguably) better than risking pinning the thread at 100% due to the busy `while` loop.	2022-04-06 10:49:14 -07:00
Eelco Dolstra	71a036ed00	Update to Nix master Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/ec90fc4d1f42db3c5e3c74dc186487d10a28c221' (2022-04-05) → 'github:NixOS/nix/5fe4fe823c193cbb7bfa05a468de91eeab09058d' (2022-04-05) • Updated input 'nix/nixpkgs': 'github:NixOS/nixpkgs/82891b5e2c2359d7e58d08849e4c89511ab94234' (2021-09-28) → 'github:NixOS/nixpkgs/530a53dcbc9437363471167a5e4762c5fcfa34a1' (2022-02-19)	2022-04-05 17:31:30 +02:00
Cole Helbling	8c5636fe18	hydra-queue-runner: use port 9198 by default Co-authored-by: Graham Christensen <graham@grahamc.com>	2022-04-02 17:32:14 -07:00
Eelco Dolstra	bcaad1c934	openConnection(): Don't throw exceptions in forked child On hydra.nixos.org the queue runner had child processes that were stuck handling an exception: Thread 1 (Thread 0x7f501f7fe640 (LWP 1413473) "bld~v54h5zkhmb3"): #0 futex_wait (private=0, expected=2, futex_word=0x7f50c27969b0 <_rtld_local+2480>) at ../sysdeps/nptl/futex-internal.h:146 #1 __lll_lock_wait (futex=0x7f50c27969b0 <_rtld_local+2480>, private=0) at lowlevellock.c:52 #2 0x00007f50c21eaee4 in __GI___pthread_mutex_lock (mutex=0x7f50c27969b0 <_rtld_local+2480>) at ../nptl/pthread_mutex_lock.c:115 #3 0x00007f50c1854bef in __GI___dl_iterate_phdr (callback=0x7f50c190c020 <_Unwind_IteratePhdrCallback>, data=0x7f501f7fb040) at dl-iteratephdr.c:40 #4 0x00007f50c190d2d1 in _Unwind_Find_FDE () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #5 0x00007f50c19099b3 in uw_frame_state_for () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #6 0x00007f50c190ab90 in uw_init_context_1 () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #7 0x00007f50c190b08e in _Unwind_RaiseException () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #8 0x00007f50c1b02ab7 in __cxa_throw () from /nix/store/dd8swlwhpdhn6bv219562vyxhi8278hs-gcc-10.3.0-lib/lib/libstdc++.so.6 #9 0x00007f50c1d01abe in nix::parseURL (url="root@cb893012.packethost.net") at src/libutil/url.cc:53 #10 0x0000000000484f55 in extraStoreArgs (machine="root@cb893012.packethost.net") at build-remote.cc:35 #11 operator() (__closure=0x7f4fe9fe0420) at build-remote.cc:79 ... Maybe the fork happened while another thread was holding some global stack unwinding lock (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71744). Anyway, since the hanging child inherits all file descriptors to SSH clients, shutting down remote builds (via 'child.to = -1' in State::buildRemote()) doesn't work and 'child.pid.wait()' hangs forever. So let's not do any significant work between fork and exec.	2022-03-30 22:39:48 +02:00
ajs124	089da272c7	fix build against nix 2.7.0 fix build after such commits as df552ff53e68dff8ca360adbdbea214ece1d08ee and e862833ec662c1bffbe31b9a229147de391e801a	2022-03-29 15:38:24 -04:00
ajs124	c64c5f0a7e	hydra-queue-runner: rename build-result.hh to hydra-build-result.hh	2022-03-29 15:34:29 -04:00
Graham Christensen	3b048ed136	Revert "Revert "Use `copyClosure` instead of `computeFSClosure` + `copyPaths`"" This reverts commit `8e3ada2afc`.	2022-03-29 15:28:47 -04:00
Cole Helbling	4789eba92c	hydra-queue-runer: split metrics functionality into its own function	2022-03-29 10:55:28 -07:00
Cole Helbling	928b3b8268	hydra-queue-runner: fix priority of flag over config file	2022-03-29 10:42:07 -07:00
Cole Helbling	5ddb9a98ca	fixup! hydra-queue-runner: log message before and after exporter is started	2022-03-29 08:47:41 -07:00
Cole Helbling	905a7a7beb	hydra-queue-runner: read metrics port from `queue_runner_metrics_port` config	2022-03-29 08:46:43 -07:00
Cole Helbling	9cdc5aceed	hydra-queue-runner: log message before and after exporter is started This way, if something goes wrong between the two, it's easier to narrow down where the issue lies.	2022-03-29 08:41:19 -07:00
Cole Helbling	8e3ada2afc	Revert "Use `copyClosure` instead of `computeFSClosure` + `copyPaths`" This reverts commit `f14c583ce5`.	2022-03-28 09:54:02 -07:00
Eelco Dolstra	962bf36939	Merge pull request #1162 from obsidiansystems/less-ref Make `copyClosureTo` take a regular C++ ref to the store	2022-03-23 16:25:59 +01:00
Eelco Dolstra	3390415905	Merge pull request #1125 from obsidiansystems/simplify--copyClosure Use `copyClosure` instead of `computeFSClosure` + `copyPaths`	2022-03-23 12:49:22 +01:00
Cole Helbling	8503a7917b	fixup! hydra-queue-runner: make registry member of State, configurable metrics port	2022-03-22 13:38:13 -07:00
Graham Christensen	e5393c2cf8	fixup: make id non-ambiguous	2022-03-19 23:56:47 -04:00
Graham Christensen	137be3452e	Reduce the jobset cols on the remaining two queries	2022-03-19 23:56:47 -04:00
Graham Christensen	f353a7ac41	update-gc-roots: try subselecting the jobset table	2022-03-19 23:56:47 -04:00
Graham Christensen	145667cb53	hydra-update-gc-roots: allow cached refs to the build's jobset Re-executing this search_related on every access turned out to create very problematic performance. If a jobset had a lot of error output stored in the jobset, and there were many hundreds or thousands of active jobs, this could easily cause >1Gbps of network traffic.	2022-03-19 23:56:47 -04:00
Graham Christensen	a582e4c485	HydraTestContext: add \n's to various dies	2022-03-19 14:46:53 -04:00
Graham Christensen	074a2f96bf	hydra-eval-jobset: emit a useful error if constituents errored	2022-03-19 14:37:12 -04:00
Cole Helbling	c0f826b92d	hydra-queue-runner: get the listening port from the exposer itself Otherwise, when the port is randomly chosen (e.g. by specifying no port, or a port of 0), it will just show that the port is 0 and not the port that is actually serving the metrics.	2022-03-14 08:41:45 -07:00
Cole Helbling	52a29d43e6	hydra-queue-runner: make registry member of State, configurable metrics port Thanks to the updated prometheus-cpp library, specifying a port of 0 will cause it to pick a random (available) port -- ideal for tests.	2022-03-11 11:58:10 -08:00
Cole Helbling	3bf31bd6a6	hydra-queue-runner: add simple "up" exporter There are probably better ways to achieve this (and will likely need to be refactored a bit to support further metrics).	2022-03-10 12:36:58 -08:00
Graham Christensen	9316544abf	src/hydra-eval-jobs/hydra-eval-jobs.cc: .get<std::string> for drvPath Co-authored-by: Kayla Fire <firestack@users.noreply.github.com>	2022-02-21 12:41:21 -05:00
Graham Christensen	290e0653ad	hydra-eval-jobs: GC root aggregate jobs	2022-02-20 12:28:40 -05:00
John Ericson	445bba337b	Make `copyClosureTo` take a regular C++ ref to the store This is syntactically lighter wait, and demonstates there are no weird dynamic lifetimes involved, just regular passing reference to callee which it only borrows for the duration of the call.	2022-02-20 17:22:43 +00:00
John Ericson	f14c583ce5	Use `copyClosure` instead of `computeFSClosure` + `copyPaths` It is more terse, and in the future it is possible `copyClosure` will become more sophisticated.	2022-02-19 11:59:17 -05:00
Graham Christensen	4c41ca08e1	Merge pull request #1155 from helsinki-systems/fix/graph-readability build-graphs: Fix readability in dark mode	2022-02-14 11:27:37 -05:00
Janne Heß	6d146deaf0	build-graphs: Fix readability in dark mode	2022-02-13 14:00:17 +01:00
Graham Christensen	27ddde1e9e	dynamic runcommand: print a notice on the build page if it is disabled	2022-02-11 15:04:54 -05:00
Cole Helbling	a22a8fa62d	AddBuilds: reject declarative jobsets with dynamic runcommand enabled if disabled elsewhere	2022-02-11 14:35:52 -05:00
Cole Helbling	928ba9e854	Controller/{Jobset,Project}: error when enabling dynamic runcommand but it's disabled elsewhere	2022-02-11 14:35:52 -05:00
Cole Helbling	d680c209fe	edit-project.tt: disable when disabled by server Also add a tooltip describing why it's disabled, to make it easier to chase down.	2022-02-11 14:35:52 -05:00
Cole Helbling	6053e5fd4b	edit-jobset.tt: disable when disabled by project and server Also add a tooltip describing why it's disabled, to make it easier to chase down.	2022-02-11 14:35:52 -05:00
Cole Helbling	dfd3a67424	project.tt: more info on why Dynamic RunCommand is disabled	2022-02-11 14:35:52 -05:00
Cole Helbling	3f4f183792	jobset.tt: more info on why Dynamic RunCommand is disabled	2022-02-11 14:35:52 -05:00

1 2 3 4 5 ...

2865 commits