From Bugzilla Helper: User-Agent: Mozilla/4.76 [en] (X11; U; Linux 2.4.2-2smp i686) Description of problem: We are developing a large multi-tasking application (using Linux Threads). We use ps in scripts to kill jobs that are "hung" due to our own bugs. After several runs/kills, the system gets into a state where ps will no longer list all of the selected processes - it hangs. In a similar manner, "killall -9 (our program here)" will also hang instead of killing the selected application. Attempts to cleanly shut down the system also fail at the step where it attempts to kill all the running jobs. How reproducible: Sometimes Steps to Reproduce: 1. Run script to start our application. Debug, get stuck. 2. Run script to kill our application (killall -9 [our app]). [repeat 1 & 2 until system begins to hang] Actual Results: Several iterations of these two steps work OK. After first failure, always fails to kill the application. Other utilities like ps, top, and shutdown also begin to fail to work. Expected Results: Application runs after [1]. Application is dead after [2]. Additional info: If we use the reset button after attempting to shutdown the system, the file system is not save properly [and e2fsck runs on next boot].
We have the same problem here. After more than 200 processes ps stops to work. We could solve this problem after switching back to a patched procps-2.06 instead of 2.07.
I'm experiencing the same problem running kernel 2.4.9-31 and procps-2.0.7-11 on a AMD Athlon.
Bug 61503 is likely related to this.
*** This bug has been marked as a duplicate of 61503 ***