lix-releng-staging/doc/manual/rl-next
Maximilian Bosch 045ee37438 libstore/local-derivation-goal: prohibit creating setuid/setgid binaries
With Linux kernel >=6.6 & glibc 2.39 a `fchmodat2(2)` is available that
isn't filtered away by the libseccomp sandbox.

Being able to use this to bypass that restriction has surprising results
for some builds such as lxc[1]:

> With kernel ≥6.6 and glibc 2.39, lxc's install phase uses fchmodat2,
> which slips through 9b88e52846/src/libstore/build/local-derivation-goal.cc (L1650-L1663).
> The fixupPhase then uses fchmodat, which fails.
> With older kernel or glibc, setting the suid bit fails in the
> install phase, which is not treated as fatal, and then the
> fixup phase does not try to set it again.

Please note that there are still ways to bypass this sandbox[2] and this is
mostly a fix for the breaking builds.

This change works by creating a syscall filter for the `fchmodat2`
syscall (number 452 on most systems). The problem is that glibc 2.39
is needed to have the correct syscall number available via
`__NR_fchmodat2` / `__SNR_fchmodat2`, but this flake is still on
nixpkgs 23.11. To have this change everywhere and not dependent on the
glibc this package is built against, I added a header
"fchmodat2-compat.hh" that sets the syscall number based on the
architecture. On most platforms its 452 according to glibc with a few
exceptions:

    $ rg --pcre2 'define __NR_fchmodat2 (?!452)'
    sysdeps/unix/sysv/linux/x86_64/x32/arch-syscall.h
    58:#define __NR_fchmodat2 1073742276

    sysdeps/unix/sysv/linux/mips/mips64/n32/arch-syscall.h
    67:#define __NR_fchmodat2 6452

    sysdeps/unix/sysv/linux/mips/mips64/n64/arch-syscall.h
    62:#define __NR_fchmodat2 5452

    sysdeps/unix/sysv/linux/mips/mips32/arch-syscall.h
    70:#define __NR_fchmodat2 4452

    sysdeps/unix/sysv/linux/alpha/arch-syscall.h
    59:#define __NR_fchmodat2 562

I added a small regression-test to the setuid integration-test that
attempts to set the suid bit on a file using the fchmodat2 syscall.
I confirmed that the test fails without the change in
local-derivation-goal.

Additionally, we require libseccomp 2.5.5 or greater now: as it turns
out, libseccomp maintains an internal syscall table and
validates each rule against it. This means that when using libseccomp
2.5.4 or older, one may pass `452` as syscall number against it, but
since it doesn't exist in the internal structure, `libseccomp` will refuse
to create a filter for that. This happens with nixpkgs-23.11, i.e. on
stable NixOS and when building Lix against the project's flake.

To work around that

* a backport of libseccomp 2.5.5 on upstream nixpkgs has been
  scheduled[3].

* the package now uses libseccomp 2.5.5 on its own already. This is to
  provide a quick fix since the correct fix for 23.11 is still a staging cycle
  away.

We still need the compat header though since `SCMP_SYS(fchmodat2)`
internally transforms this into `__SNR_fchmodat2` which points to
`__NR_fchmodat2` from glibc 2.39, so it wouldn't build on glibc 2.38.
The updated syscall table from libseccomp 2.5.5 is NOT used for that
step, but used later, so we need both, our compat header and their
syscall table 🤷

Relevant PRs in CppNix:

* https://github.com/NixOS/nix/pull/10591
* https://github.com/NixOS/nix/pull/10501

[1] https://github.com/NixOS/nixpkgs/issues/300635#issuecomment-2031073804
[2] https://github.com/NixOS/nixpkgs/issues/300635#issuecomment-2030844251
[3] https://github.com/NixOS/nixpkgs/pull/306070

(cherry picked from commit ba6804518772e6afb403dd55478365d4b863c854)
Change-Id: I6921ab5a363188c6bff617750d00bb517276b7fe
2024-05-03 16:29:06 +02:00
..
better-errors-in-nix-repl.md Print top-level errors normally in nix repl 2024-04-09 08:34:40 -07:00
debugger-locals-for-let-expressions.md manual: fix release notes 2024-03-27 03:09:14 +00:00
debugger-on-trace.md Merge pull request #9914 from 9999years/debugger-on-trace 2024-03-09 10:17:26 -07:00
drop-vendored-toml11.md Stop vendoring toml11 2024-03-27 21:04:00 -04:00
drv-string-parse-hang.md Merge pull request #9673 from pennae/drv-parse-opts 2024-03-04 07:36:51 +01:00
dup-attr-errors.md build: replace changelog-d with local script 2024-03-27 03:09:14 +00:00
empty-search-regex.md manual: fix release notes 2024-03-27 03:09:14 +00:00
enter-debugger-more-reliably-in-let-and-calls.md Merge pull request #9917 from 9999years/enter-debugger-more-reliably 2024-03-09 03:37:35 -07:00
env-size-reduction.md Merge pull request #9658 from pennae/env-diet 2024-03-04 07:37:45 +01:00
eval-system.md Merge pull request #4093 from matthewbauer/eval-system 2024-03-04 07:21:01 +01:00
fchmodat2-sandbox.md libstore/local-derivation-goal: prohibit creating setuid/setgid binaries 2024-05-03 16:29:06 +02:00
forbid-nested-debuggers.md Merge pull request #9920 from 9999years/forbid-nested-debuggers 2024-03-31 17:28:25 +00:00
formal-order.md normalize formal order on ExprLambda::show 2024-03-18 07:56:34 -06:00
inherit-error-positions.md report inherit attr errors at the duplicate name 2024-03-18 16:12:45 +01:00
inherit-from-by-need.md evaluate inherit (from) exprs only once per directive 2024-03-10 03:18:32 -06:00
new-assertions.md build: enable libstdc++ assertions 2024-04-08 15:40:12 -07:00
nix-env-json-drv-path.md Merge pull request #9573 from hercules-ci/rl-next-md-frontmatter 2024-03-04 07:12:09 +01:00
nix-flake-check-logs-actions.md Add release notes 2024-03-07 12:29:57 -08:00
nix-flake-update-ux.md manual: fix release notes 2024-03-27 03:09:14 +00:00
nixversion-fake.md build: replace changelog-d with local script 2024-03-27 03:09:14 +00:00
no-cache-eval-errors.md always re-eval cached failures 2024-04-06 04:35:25 +00:00
print-value-in-coercion-error.md manual: fix release notes 2024-03-27 03:09:14 +00:00
print-value-in-type-error.md manual: fix release notes 2024-03-27 03:09:14 +00:00
reduce-debugger-clutter.md Merge pull request #9919 from 9999years/reduce-debugger-clutter 2024-03-04 08:52:57 +01:00
repl-doc-command.md repl: improve :doc builtin repl command to support lambdas. 2024-04-03 13:47:22 -06:00
repl-overlays.md Add repl-overlays 2024-04-08 17:11:47 -07:00
short-expr-flag.md build: replace changelog-d with local script 2024-03-27 03:09:14 +00:00
source-location-in-while-evaluating-attribute.md Merge pull request #9915 from 9999years/evaluating-attribute-position 2024-03-04 09:25:17 +01:00
source-positions-in-errors.md Merge pull request #9753 from 9999years/print-value-on-type-error 2024-03-09 00:05:41 -07:00
stack-overflow-segfaults.md Merge pull request #9617 from 9999years/stack-overflow-segfault 2024-03-04 07:35:20 +01:00
upstart-removal.md build: replace changelog-d with local script 2024-03-27 03:09:14 +00:00
with-error-reporting.md Merge pull request #9753 from 9999years/print-value-on-type-error 2024-03-09 00:05:41 -07:00