Bug 496087 - FUJITSU drive seems not to encode reallocated secotrs in smart attr 5
Summary: FUJITSU drive seems not to encode reallocated secotrs in smart attr 5
Keywords:
Status: CLOSED RAWHIDE
Alias: None
Product: Fedora
Classification: Fedora
Component: libatasmart
Version: rawhide
Hardware: All
OS: Linux
low
medium
Target Milestone: ---
Assignee: Lennart Poettering
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
: 496153 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2009-04-16 14:55 UTC by Jiri Cerny
Modified: 2009-10-23 01:44 UTC (History)
6 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2009-04-22 23:40:10 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)
output of 'skdump --save=mysmartdata /dev/sda' (1.54 KB, application/octet-stream)
2009-04-16 20:34 UTC, Jiri Cerny
no flags Details

Description Jiri Cerny 2009-04-16 14:55:03 UTC
On the current rawhide palipsest reports bad sectors on my hard disk. In details window two attributes fails. When I try smartctl -a or smartctl -l on the same disk, it reports no error. So, I do not know whom to believe. 

Maybe one remark. I tried to read DK-disks sources and to come up with a small test program. I found that sk_disk_smart_get_bad function (which I think is used to check for bad blocks) returns -1 on my disk. Similarly,
sk_disk_smart_get_overall returns GOOD, which should mean no bad blocks. 

I attach the output of  devkit-disks --show-info /dev/sda and of smartctl -a /dev/sda. 


devkit-disks --show-info /dev/sda
Showing information for /org/freedesktop/DeviceKit/Disks/devices/sda
  native-path:             /sys/devices/pci0000:00/0000:00:1f.2/host0/target0:0:0/0:0:0:0/block/sda
  device:                  8:0
  device-file:             /dev/sda
    by-id:                 /dev/disk/by-id/ata-FUJITSU_MHY2120BH_K434T81257SL
    by-id:                 /dev/disk/by-id/scsi-SATA_FUJITSU_MHY2120_K434T81257SL
    by-path:               /dev/disk/by-path/pci-0000:00:1f.2-scsi-0:0:0:0
  system internal:         1
  removable:               0
  has media:               1
    detects change:        0
    detection by polling:  0
    detection inhibitable: 0
    detection inhibited:   0
  is read only:            0
  is mounted:              0
  mount paths:             
  mounted by uid:          0
  presentation name:       
  presentation icon:       
  size:                    120034123776
  block size:              512
  job underway:            no
  usage:                   
  type:                    
  version:                 
  uuid:                    
  label:                   
  partition table:
    scheme:                mbr
    count:                 9
  drive:
    vendor:                ATA
    model:                 FUJITSU MHY2120B
    revision:              0084
    serial:                K434T81257SL
    ejectable:             0
    require eject:         0
    media:                 
      compat:             
    interface:             ata
    if speed:              (unknown)
    ATA SMART:             Updated at Čt 16. duben 2009, 16:14:00 CEST
      assessment:          PASSED
      bad sectors:         Yes
      attributes:          One ore more attributes exceed threshold
      temperature:         41° C / 106° F
      powered on:          92 days
      offline data:        never collected (487 second(s) to complete)
      self-test status:    success or never (0% remaining)
      ext./short test:     available
      conveyance test:     available
      start test:          available
      abort test:          available
      short test:            2 minute(s) recommended polling time
      ext. test:            69 minute(s) recommended polling time
      conveyance test:       2 minute(s) recommended polling time
===============================================================================
 Attribute       Current/Worst/Threshold  Status   Value       Type     Updates
===============================================================================
 raw-read-error-rate         100/100/ 46   good    173680      Prefail  Online 
 throughput-performance      100/100/ 30   good    0           Prefail  Offline
 spin-up-time                100/100/ 25   good    1 msec      Prefail  Online 
 start-stop-count             99/ 99/  0    n/a    720         Old-age  Online 
 reallocated-sector-count    100/100/ 24   FAIL    8589934592000 sectors Prefail  Online 
 seek-error-rate             100/100/ 47   good    305         Prefail  Online 
 seek-time-performance       100/100/ 19   good    0           Prefail  Offline
 power-on-hours               96/ 96/  0    n/a    92 days     Old-age  Online 
 spin-retry-count            100/100/ 20   good    0           Prefail  Online 
 power-cycle-count           100/100/  0    n/a    634         Old-age  Online 
 power-off-retract-count     100/100/  0    n/a    14          Old-age  Online 
 load-cycle-count             98/ 98/  0    n/a    57867       Old-age  Online 
 temperature-celsius-2       100/100/  0    n/a    41C / 106F  Old-age  Online 
 hardware-ecc-recovered      100/100/  0    n/a    29          Old-age  Online 
 reallocated-event-count     100/100/  0   FAIL    459603968 sectors Old-age  Online 
 current-pending-sector      100/100/  0    n/a    0 sectors   Old-age  Online 
 offline-uncorrectable       100/100/  0    n/a    0 sectors   Old-age  Offline
 udma-crc-error-count        200/253/  0    n/a    0           Old-age  Online 
 multi-zone-error-rate       100/100/ 60   good    19949       Prefail  Online 
 run-out-cancel              100/100/  0    n/a    1533207445856 Old-age  Online 
 head-flying-hours           200/200/  0    n/a    0 msec      Old-age  Online 

=====

smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     FUJITSU MHY2120BH
Serial Number:    K434T81257SL
Firmware Version: 0084000D
User Capacity:    120 034 123 776 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 3f
Local Time is:    Thu Apr 16 16:31:10 2009 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		 ( 487) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  69) minutes.
Conveyance self-test routine
recommended polling time: 	 (   2) minutes.
SCT capabilities: 	       (0x003d)	SCT Status supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   100   100   046    Pre-fail  Always       -       159827
  2 Throughput_Performance  0x0005   100   100   030    Pre-fail  Offline      -       26804224
  3 Spin_Up_Time            0x0003   100   100   025    Pre-fail  Always       -       1
  4 Start_Stop_Count        0x0032   099   099   000    Old_age   Always       -       720
  5 Reallocated_Sector_Ct   0x0033   100   100   024    Pre-fail  Always       -       8589934592000
  7 Seek_Error_Rate         0x000f   100   100   047    Pre-fail  Always       -       3919
  8 Seek_Time_Performance   0x0005   100   100   019    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0032   096   096   000    Old_age   Always       -       2208
 10 Spin_Retry_Count        0x0013   100   100   020    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       634
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       14
193 Load_Cycle_Count        0x0032   098   098   000    Old_age   Always       -       57872
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       39 (Lifetime Min/Max 15/54)
195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       47
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       459603968
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   253   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x000f   100   100   060    Pre-fail  Always       -       20758
203 Run_Out_Cancel          0x0002   100   100   000    Old_age   Always       -       433706435317
240 Head_Flying_Hours       0x003e   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      2196         -
# 2  Short offline       Completed without error       00%      2187         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Comment 1 David Zeuthen 2009-04-16 15:09:04 UTC
I think Lennart recently fixed a bug in libatasmart for this

 http://git.0pointer.de/?p=libatasmart.git;a=commitdiff;h=6afdd8b2c56c09ab6d20d8d53aeee5a2faec44e9

Reassigning to libatasmart since that's how we get this data.

Comment 2 Lennart Poettering 2009-04-16 18:06:50 UTC
Oh, and maybe I should note that the commit David mentioned does *not* fix the issue.

Comment 3 Lennart Poettering 2009-04-16 18:09:45 UTC
Jiri, could you please get me the raw SMART data from you drive? Please attach the file that 'skdump --save=mysmartdata /dev/sda' generates.

Comment 4 Jiri Cerny 2009-04-16 20:34:53 UTC
Created attachment 339917 [details]
output of 'skdump --save=mysmartdata /dev/sda' 

Lennart, here are the raw SMART data.

Comment 5 Lennart Poettering 2009-04-17 21:54:38 UTC
*** Bug 496153 has been marked as a duplicate of this bug. ***

Comment 6 Lennart Poettering 2009-04-22 23:40:10 UTC
Fixed in 0.12-2. libatasmart should now parse your smart data correctly (verfied on the blob you supplied). Will hopefully enter f11 soon.

Comment 7 Zack Cerza 2009-04-23 18:12:36 UTC
Thanks! I'll follow-up once I ge the update.

Comment 8 Zack Cerza 2009-04-23 19:13:27 UTC
Awesome. Looks to be fixed.

Comment 9 David 2009-08-22 15:19:07 UTC
I think it's the same bug, on Hitachi Travelstar 60G
Model HTS726060M9AT100 Rev 1.0.

Note* encrypted disk.

Report failing bad sectors.

# devkit-disks --show-info /dev/sda
Showing information for /org/freedesktop/DeviceKit/Disks/devices/sda
  native-path:             /sys/devices/pci0000:00/0000:00:1f.1/host0/target0:0:0/0:0:0:0/block/sda
  device:                  8:0
  device-file:             /dev/sda
    by-id:                 /dev/disk/by-id/ata-HTS726060M9AT00_MRH436M4J526KB
    by-id:                 /dev/disk/by-id/scsi-SATA_HTS726060M9AT00_MRH436M4J526KB
    by-path:               /dev/disk/by-path/pci-0000:00:1f.1-scsi-0:0:0:0
  detected at:             Sat 22 Aug 2009 09:06:05 AM EDT
  system internal:         1
  removable:               0
  has media:               1 (detected at Sat 22 Aug 2009 09:06:05 AM EDT)
    detects change:        0
    detection by polling:  0
    detection inhibitable: 0
    detection inhibited:   0
  is read only:            0
  is mounted:              0
  mount paths:             
  mounted by uid:          0
  presentation hide:       0
  presentation name:       
  presentation icon:       
  size:                    60011642880
  block size:              512
  job underway:            no
  usage:                   
  type:                    
  version:                 
  uuid:                    
  label:                   
  partition table:
    scheme:                mbr
    count:                 2
  drive:
    vendor:                ATA
    model:                 HTS726060M9AT00
    revision:              MH4O
    serial:                MRH436M4J526KB
    ejectable:             0
    require eject:         0
    media:                 
      compat:             
    interface:             ata
    if speed:              (unknown)
    ATA SMART:             Updated at Sat 22 Aug 2009 10:36:06 AM EDT
      assessment:          PASSED
      bad sectors:         Yes
      attributes:          One ore more attributes exceed threshold
      temperature:         42° C / 108° F
      powered on:          426 days
      offline data:        aborted (645 second(s) to complete)
      self-test status:    in progress (80% remaining)
      ext./short test:     available
      conveyance test:     not available
      start test:          available
      abort test:          available
      short test:            2 minute(s) recommended polling time
      ext. test:            39 minute(s) recommended polling time
      conveyance test:       0 minute(s) recommended polling time
===============================================================================
 Attribute       Current/Worst/Threshold  Status   Value       Type     Updates
===============================================================================
 raw-read-error-rate         100/100/ 62   good    65536       Prefail  Online 
 throughput-performance      100/100/ 40   good    0           Prefail  Offline
 spin-up-time                253/100/ 33   good    2 msec      Prefail  Online 
 start-stop-count             99/ 99/  0    n/a    2038        Old-age  Online 
 reallocated-sector-count    100/100/  5   FAIL    65541 sectors Prefail  Online 
 seek-error-rate             100/100/ 67   good    0           Prefail  Online 
 seek-time-performance       100/100/ 40   good    0           Prefail  Offline
 power-on-hours               77/ 77/  0    n/a    426 days    Old-age  Online 
 spin-retry-count            100/100/ 60   good    0           Prefail  Online 
 power-cycle-count           100/100/  0    n/a    1374        Old-age  Online 
 g-sense-error-rate          100/ 92/  0    n/a    0           Old-age  Online 
 power-off-retract-count     100/100/  0    n/a    96          Old-age  Online 
 load-cycle-count             60/ 60/  0    n/a    402060      Old-age  Online 
 temperature-celsius-2       130/ 93/  0    n/a    42C / 108F  Old-age  Online 
 reallocated-event-count     100/100/  0    n/a    1           Old-age  Online 
 current-pending-sector      100/100/  0    n/a    0 sectors   Old-age  Online 
 offline-uncorrectable       100/100/  0    n/a    0 sectors   Old-age  Offline
 udma-crc-error-count        200/200/  0    n/a    0           Old-age  Online 

smartctl -a /dev/sda
smartctl version 5.38 [i386-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Hitachi Travelstar 7K60
Device Model:     HTS726060M9AT00
Serial Number:    MRH436M4J526KB
Firmware Version: MH4OA6AA
User Capacity:    60,011,642,880 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  ATA/ATAPI-6 T13 1410D revision 3a
Local Time is:    Sat Aug 22 11:08:14 2009 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x85)	Offline data collection activity
					was aborted by an interrupting command from host.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      ( 242)	Self-test routine in progress...
					20% of test remaining.
Total time to complete Offline 
data collection: 		 ( 645) seconds.
Offline data collection
capabilities: 			 (0x5b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   4) minutes.
Extended self-test routine
recommended polling time: 	 (  41) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   062    Pre-fail  Always       -       1
  2 Throughput_Performance  0x0005   100   100   040    Pre-fail  Offline      -       1914
  3 Spin_Up_Time            0x0007   253   100   033    Pre-fail  Always       -       2
  4 Start_Stop_Count        0x0012   099   099   000    Old_age   Always       -       2038
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       65541
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   040    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0012   077   077   000    Old_age   Always       -       10230
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       1374
191 G-Sense_Error_Rate      0x000a   100   092   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       96
193 Load_Cycle_Count        0x0012   060   060   000    Old_age   Always       -       402060
194 Temperature_Celsius     0x0002   117   093   000    Old_age   Always       -       47 (Lifetime Min/Max 3/59)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       1
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     10228         -
# 2  Extended offline    Completed without error       00%     10205         -
# 3  Short offline       Completed without error       00%         2         -
# 4  Short offline       Completed without error       00%         2         -
# 5  Short offline       Completed without error       00%         1         -
# 6  Short offline       Completed without error       00%         1         -
# 7  Short offline       Completed without error       00%         1         -
# 8  Short offline       Completed without error       00%         0         -
# 9  Short offline       Completed without error       00%         0         -
#10  Short offline       Completed without error       00%         0         -

Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum.
SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Oops Fedora 11 beta, not rawhide.

Comment 10 Lennart Poettering 2009-08-24 14:24:52 UTC
David, your disk has 65541 reallocated sectors, so libatamsart is doing everything correctly.

Comment 11 D. Hugh Redelmeier 2009-10-23 01:44:07 UTC
Like David, I have a Hitachi notebook drive that has an apparently high Reallocated_Sector_Ct:

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       131092

The raw count (131092) seems very high.  I get a desktop warning (from F11, libatasmart-0.12-3.fc11.x86_64) that my disk is about to fail.  I'm not sure that this is accurate since the (cooked) value is 100, which is very good.

I notice that 131092 in hex would be 20014.  David's raw 65541 is 10005 in hex.  So I wonder if the raw number is not a simple integer count of reallocated sectors.  This suggests that the bottom 16-bits might be a separate counter from the rest of the number.

This drive is:
 Model Family:     Hitachi Travelstar 80GN family
 Device Model:     IC25N060ATMR04-0


Note You need to log in before you can comment on or make changes to this bug.