Bug 668681

Summary: NUMA: some nodes are missing on nec-em25.rhts.eng.bos.redhat.com
Product: Red Hat Enterprise Linux 6 Reporter: Cong Wang <amwang>
Component: kernelAssignee: Cong Wang <amwang>
Status: CLOSED NOTABUG QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: high    
Version: 6.0CC: bpicco, jfeeney, jthomas, jwest, kzhang, lwoodman, rkhan
Target Milestone: rc   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-03-30 04:23:12 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 668340    
Bug Blocks:    
Attachments:
Description Flags
rhel6 kernel log after applying patch for Bug 668340 none

Description Cong Wang 2011-01-11 08:52:12 UTC
Created attachment 472760 [details]
rhel6 kernel log after applying patch for Bug 668340

Description of problem:
On machine nec-em25.rhts.eng.bos.redhat.com, I got:
[root@nec-em25 ~]# numactl --hardware
available: 6 nodes (1-3,5-7)
node 1 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79
node 1 size: 16303 MB
node 1 free: 15616 MB
node 2 cpus: 16 17 18 19 20 21 22 23 80 81 82 83 84 85 86 87
node 2 size: 16384 MB
node 2 free: 15821 MB
node 3 cpus: 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103
node 3 size: 16384 MB
node 3 free: 15619 MB
node 5 cpus: 40 41 42 43 44 45 46 47 104 105 106 107 108 109 110 111
node 5 size: 16384 MB
node 5 free: 15784 MB
node 6 cpus: 48 49 50 51 52 53 54 55 112 113 114 115 116 117 118 119
node 6 size: 16384 MB
node 6 free: 15668 MB
node 7 cpus: 56 57 58 59 60 61 62 63 120 121 122 123 124 125 126 127
node 7 size: 16320 MB
node 7 free: 15755 MB
No distance information available.


Upstream has the same problem. After adding some debugging printks, I found the missing memory blocks are not enabled according to ACPI info, that is to say ACPI_SRAT_MEM_ENABLED is 0.


Version-Release number of selected component (if applicable):
2.6.32-94.el6

How reproducible:
Always

Steps to Reproduce:
1. Install RHEL6
2. Make sure it has the patch for Bug 668340, otherwise NUMA will not be recognized.
3. Boot the machine and run 'numactl --hardware'
  
Actual results:
Node 0 and Node 4 are missing, distance info is missing too.

Expected results:
Shoul be normal like on rhel5.6:
[root@nec-em25 ~]#  numactl  --hardware
available: 8 nodes (0-7)
node 0 size: 16045 MB
node 0 free: 15346 MB
node 1 size: 16160 MB
node 1 free: 16114 MB
node 2 size: 16160 MB
node 2 free: 16127 MB
node 3 size: 16160 MB
node 3 free: 16134 MB
node 4 size: 16160 MB
node 4 free: 15973 MB
node 5 size: 16160 MB
node 5 free: 16134 MB
node 6 size: 16160 MB
node 6 free: 16133 MB
node 7 size: 16096 MB
node 7 free: 16058 MB
node distances:
node   0   1   2   3   4   5   6   7 
  0:  10  15  15  20  20  15  20  20 
  1:  15  10  20  15  20  20  20  20 
  2:  15  20  10  15  20  20  20  15 
  3:  20  15  15  10  15  20  20  20 
  4:  20  20  20  15  10  15  15  20 
  5:  15  20  20  20  15  10  20  15 
  6:  20  20  20  20  15  20  10  15 
  7:  20  20  15  20  20  15  15  10 


Additional info:
See attached log.

Comment 2 RHEL Program Management 2011-02-01 06:10:20 UTC
This request was evaluated by Red Hat Product Management for
inclusion in the current release of Red Hat Enterprise Linux.
Because the affected component is not scheduled to be updated
in the current release, Red Hat is unfortunately unable to
address this request at this time. Red Hat invites you to
ask your support representative to propose this request, if
appropriate and relevant, in the next release of Red Hat
Enterprise Linux. If you would like it considered as an
exception in the current release, please ask your support
representative.

Comment 3 RHEL Program Management 2011-02-01 19:04:58 UTC
This request was erroneously denied for the current release of
Red Hat Enterprise Linux.  The error has been fixed and this
request has been re-proposed for the current release.