CI: Increase default RCU stall timeout on Linux

When CONFIG_RCU_CPU_STALL_TIMEOUT is configured an RCU stall which
exceeds the default timeout will trigger an NMI and panic the VM.
Given the heavily virtualized nature of the CI environment we want
to make sure to only trigger this due to a real deadlock and not
due to over-subscription of the systems resources.  This timeout
normally defaults to 20-30 seconds and this change increases it
to 120 seconds.

Reviewed-by: Tony Hutter <hutter2@llnl.gov>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #18624
This commit is contained in:
Brian Behlendorf
2026-06-04 17:41:11 -07:00
committed by GitHub
parent cae1421e8d
commit c4d0f3dd41
@@ -186,6 +186,13 @@ case "$OS" in
sudo mount -o noatime /dev/vdb /var/tmp
sudo chmod 1777 /var/tmp
sudo mv -f /tmp/*.txt /var/tmp
# Allow for longer RCU timeouts due to the heavily virtualized and
# potentially oversubscribed nature of the CI environment.
rcu_cpu_stall_timeout="/sys/module/rcupdate/parameters/rcu_cpu_stall_timeout"
if test -f $rcu_cpu_stall_timeout; then
echo 120 | sudo sh -c "cat > '$rcu_cpu_stall_timeout'"
fi
;;
esac