Bug 58833
Summary: | ext3 BUG: transaction.c:708 | ||
---|---|---|---|
Product: | [Retired] Red Hat Linux | Reporter: | Michael K. Johnson <johnsonm> |
Component: | kernel | Assignee: | Stephen Tweedie <sct> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Brock Organ <borgan> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 7.2 | CC: | shishz |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i386 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2004-01-05 20:28:43 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Attachments: |
Description
Michael K. Johnson
2002-01-25 15:20:52 UTC
Created attachment 43494 [details]
The ksyms file corresponding to the original BUG() message
Created attachment 43495 [details]
Oops from after reboot, trying to access the same filesystem (/home)
Created attachment 43496 [details]
ksyms corresponding to the oops after reboot
Finally, after oops on reboot, I did touch /forcefsck and rebooted again, and got one error message regarding /home: fsck: /boot: 46/13832 files (4.3% non-contiguous), 9777/55296 blocks fsck: /home: recovering journal fsck: Inode 32775, i_blocks is 2147778496, should be 83381808. FIXED. fsck: /home: 17658/5881856 files (0.9% non-contiguous), 11165842/11753520 blocks 32775, at least now, is the .xsession-errors file from the user who was logged in at the time of the original BUG(). I believe that has not changed. The file has now been truncated by restarting X, so I do not have the broken contents. For all I know, this is an instance of hardware failure, but I wanted to record it in case it provides useful data later relative to some other bug report. I am therefore putting it in NEEDINFO state. I've got the same soft of problem. I was testing a system with lucifer writing in the root filesystem. At the moment of the problem it is quite possible that the root-filesystem ( on which lucifer was running) was getting full. I had to fsck to get out of the problems ( forgot to save the output ). We will rerun lucifer on another filisystem ( /var ) that is nog getting full I was just adding this to give more info. Jun 16 17:25:34 dwalin3 kernel: Assertion failure in do_get_write_access() at transaction.c:590: "handle->h_buffer_credits > 0" Jun 16 17:25:34 dwalin3 kernel: ------------[ cut here ]------------ Jun 16 17:25:34 dwalin3 kernel: kernel BUG at transaction.c:590! Jun 16 17:25:34 dwalin3 kernel: invalid operand: 0000 Jun 16 17:25:34 dwalin3 kernel: CPU: 0 Jun 16 17:25:34 dwalin3 kernel: EIP: 0010: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9-13/kernel/drivers/net/3c+- 1402422/96] Not tainted Jun 16 17:25:34 dwalin3 kernel: EIP: 0010:[<c88009ca>] Not tainted Jun 16 17:25:34 dwalin3 kernel: EFLAGS: 00010296 Jun 16 17:25:34 dwalin3 kernel: eax: 00000021 ebx: 00000000 ecx: 00000001 edx: 00001d61 Jun 16 17:25:34 dwalin3 kernel: esi: c6e261e0 edi: c1548ce0 ebp: c06a12e0 esp: c7917d9c Jun 16 17:25:34 dwalin3 kernel: ds: 0018 es: 0018 ss: 0018 Jun 16 17:25:34 dwalin3 kernel: Process lucifer (pid: 8784, stackpage=c7917000) Jun 16 17:25:34 dwalin3 kernel: Stack: c88091b0 0000024e 00000000 00000000 0006d953 c1483000 c6e26ae0 c1483094 Jun 16 17:25:34 dwalin3 kernel: c1483000 c36e9320 c06a12e0 c8800eb6 c36e9320 c06a12e0 00000000 c1548ce0 Jun 16 17:25:34 dwalin3 kernel: c147e400 c36e9320 c5c79c80 c8813468 c36e9320 c1548ce0 00000000 c8800284 Jun 16 17:25:34 dwalin3 kernel: Call Trace: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9-13/kernel/drivers/net/3c+-1367632/96] __insmod_jbd_S.rodata_L96 [jbd] 0x25c0 Jun 16 17:25:34 dwalin3 kernel: Call Trace: [<c88091b0>] __insmod_jbd_S.rodata_L96 [jbd] 0x25c0 Jun 16 17:25:34 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1401162/96] journal_get_write_access_R74245737 [jbd] 0x36 Jun 16 17:25:34 dwalin3 kernel: [<c8800eb6>] journal_get_write_access_R74245737 [jbd] 0x36 Jun 16 17:25:34 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1325976/96] __insmod_ext3_S.text_L42880 [ext3] 0x7408 Jun 16 17:25:34 dwalin3 kernel: [<c8813468>] __insmod_ext3_S.text_L42880 [ext3] 0x7408 Jun 16 17:25:34 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1404284/96] __insmod_jbd_S.text_L26736 [jbd] 0x224 Jun 16 17:25:34 dwalin3 kernel: [<c8800284>] __insmod_jbd_S.text_L26736 [jbd] 0x224 Jun 16 17:25:34 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1404033/96] journal_start_R168f288d [jbd] 0xbf Jun 16 17:25:34 dwalin3 kernel: [<c880037f>] journal_start_R168f288d [jbd]0xbf Jun 16 17:25:35 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1335338/96] __insmod_ext3_S.text_L42880 [ext3] 0x4f76 Jun 16 17:25:35 dwalin3 kernel: [<c8810fd6>] __insmod_ext3_S.text_L42880 [ext3] 0x4f76 Jun 16 17:25:35 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1335314/96] __insmod_ext3_S.text_L42880 [ext3] 0x4f8e Jun 16 17:25:35 dwalin3 kernel: [<c8810fee>] __insmod_ext3_S.text_L42880 [ext3] 0x4f8e Jun 16 17:25:35 dwalin3 kernel: [truncate_list_pages+477/496] truncate_list_pages [kernel] 0x1dd Jun 16 17:25:35 dwalin3 kernel: [<c012797d>] truncate_list_pages [kernel] 0x1dd Jun 16 17:25:35 dwalin3 kernel: [truncate_inode_pages+143/160] truncate_inode_pages [kernel] 0x8f Jun 16 17:25:35 dwalin3 kernel: [<c0127a1f>] truncate_inode_pages [kernel] 0x8f Jun 16 17:25:35 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1335536/96] __insmod_ext3_S.text_L42880 [ext3] 0x4eb0 Jun 16 17:25:35 dwalin3 kernel: [<c8810f10>] __insmod_ext3_S.text_L42880 [ext3] 0x4eb0 Jun 16 17:25:35 dwalin3 kernel: [vmtruncate+361/384] vmtruncate [kernel] 0x169 Jun 16 17:25:35 dwalin3 kernel: [<c01258a9>] vmtruncate [kernel] 0x169 Jun 16 17:25:35 dwalin3 kernel: [generic_file_write+1522/1552] generic_file_write [kernel] 0x5f2 Jun 16 17:25:35 dwalin3 kernel: [<c012ac12>] generic_file_write [kernel] 0x5f2 Jun 16 17:25:35 dwalin3 kernel: [3c59x:__insmod_3c59x_O/lib/modules/2.4.9- 13/kernel/drivers/net/3c+-1349118/96] __insmod_ext3_S.text_L42880 [ext3] 0x19a2 Jun 16 17:25:35 dwalin3 kernel: [<c880da02>] __insmod_ext3_S.text_L42880 [ext3] 0x19a2 Jun 16 17:25:35 dwalin3 kernel: [sys_write+150/208] sys_write [kernel] 0x96 Jun 16 17:25:35 dwalin3 kernel: [<c01365c6>] sys_write [kernel] 0x96 Jun 16 17:25:35 dwalin3 kernel: [system_call+51/56] system_call [kernel] 0x33 Jun 16 17:25:35 dwalin3 kernel: [<c0106f3b>] system_call [kernel] 0x33 Jun 16 17:25:35 dwalin3 kernel: Jun 16 17:25:35 dwalin3 kernel: Jun 16 17:25:35 dwalin3 kernel: Code: 0f 0b 59 5b 8b 54 24 28 8b 42 04 8b 4c 24 28 48 89 41 04 6a I think that what happened here is related to the disk full situation. Older versions of ext3 could get their block accounting confused if the disk became full. The ext3 truncate code relies on that block accounting to know how much journal space it needs to reserve to successfully truncate a file. Having said that, ext3 should really recover from a case where i_blocks is wrong, but the current kernels should not have that problem when running out of disk space. |