Various IO, networking or curl errors causing builds to fail #1026
Labels
No labels
Affects/CppNix
Affects/Nightly
Affects/Only nightly
Affects/Stable
Area/build-packaging
Area/cli
Area/evaluator
Area/fetching
Area/flakes
Area/language
Area/lix ci
Area/nix-eval-jobs
Area/profiles
Area/protocol
Area/releng
Area/remote-builds
Area/repl
Area/repl/debugger
Area/store
bug
Context
contributors
Context
drive-by
Context
maintainers
Context
RFD
crash 💥
Cross Compilation
devx
docs
Downstream Dependents
E/easy
E/hard
E/help wanted
E/reproducible
E/requires rearchitecture
Feature/S3
imported
Language/Bash
Language/C++
Language/NixLang
Language/Python
Language/Rust
Needs Langver
OS/Linux
OS/macOS
performance
regression
release-blocker
stability
Status
blocked
Status
invalid
Status
postponed
Status
wontfix
testing
testing/flakey
Topic/Large Scale Installations
ux
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
lix-project/lix#1026
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Describe the bug
Sometimes NixOS configurations, packages or just directly downloading a nix store path fails.
After restarting the build/download (sometimes even multiple times), the issue is gone.
Steps To Reproduce
This is not consistently reproducible, if you're lucky this could work:
Expected behavior
Don't fail, or retry it under the hood and display a warning.
nix --versionoutputAdditional context
This is more likely for bigger downloads and possibly with WiFi connections, but it's not uncommon with a wired connection either.
We experienced this on all our machines: 3 x86_64-linux and 2 aarch64-linux multiple.
Screenshots
code 23 is a curl bug, truncated input is very likely to be that as well. #1009 (comment)
code 56 sounds most like a connection being torn down by the remote server for any reason than a lix bug.
code 55 is something we haven't seen ever before the curl update that caused #1009, which sounds very suspicious. we should keep an eye on that, but if anything it's another curl bug.
the signal 5 thing though, that is very interesting. can you reproduce this?
Thanks for the quick response!
I'll post here if I encounter any of the errors that could be related to lix. Unfortunately for each case (except maybe 1, I'll confirm this later) rerunning the failing command will make it pass. However after they pass it's harder to reproduce since it's already downloaded to the nix store thus no new download will be started, is there a better way than hunting down the store paths and running
nix-store --delete /nix/store/...?the download problems honestly aren't that interesting. if you do want to repro them we'd suggest building the derivation you're interested in into a fresh store (ie
nix build --store $some_new_directory ...) since you can then discard that entire store in one piece without disturbing the rest of your system.if you have the coredump or stack traces of the process that failed during build env setup that would also be very helpful.