[Eisfair] smartmon mit /dev/cciss/c0d0 funktioniert nicht mehr

Jürgen Witt j-witt at web.de
Sa Dez 3 13:50:51 CET 2016


Hallo Jürgen,

Am 03.12.2016 um 12:27 schrieb Juergen Edner:
> Hallo Jürgen,
> 
> bei der Prüfung der vorhandenen Devices wird ein Fehler 128
> erkannt, welcher verhindert dass die Konfiguration erstellt wird.
> 
> /usr/sbin/smartctl -d cciss,0 -a /dev/cciss/c0d0; echo $?
> 
> Der Fehler 128 (Bit 7) bedeutet, dass "self-test log contains records
> of errors", d.h. dass Deine Festplatte den Long-Test nicht bestanden
> hat und Fehler enthält. Siehe auch:

OK, danke für die Info.

Ich bin ja auch durch eine Email des Systems auf den Fehler hingewiesen
worden.

This email was generated by the smartd daemon running on:

   host name: eisfair
  DNS domain: lan.home
  NIS domain: (none)

The following warning/error was logged by the smartd daemon:

Device: /dev/cciss/c0d0 [cciss_disk_00], Self-Test Log error count
increased from 0 to 1

For details see host's SYSLOG (default: /var/log/messages).

You can also use the smartctl utility for further investigation.
No additional email messages about this problem will be sent.
-------------------------------------------------------
The following partitions are affected:
Device /dev/cciss/c0d0 [cciss_disk_00]:
-------------------------------------------------------

The EIS/FAIR S.M.A.R.T. Daemon

Aber verstehen tue ich nicht, weshalb die Konfiguration deshalb nicht
mehr erstellt wird und ich mir die SMART-Werte nicht mehr ansehen kann.

Ich habe z.B. einen anderen Server bei einem Kunden mit einem
Software-Raid-5 aus 3 normalen Sata-Platten. Dort wird auch eine der 3
Raid-Platten angemeckert, aber die Konfiguration wird dort normal
erstellt und ich kann mir die SMART-Werte von jedem Device ansehen.

Ich bekomme lediglich kurz nach dem Abspeichern der Konfiguration eine
System-Email mit diesem Inhalt:


This email was generated by the smartd daemon running on:

   host name: eis
  DNS domain: lan.home
  NIS domain: (none)

The following warning/error was logged by the smartd daemon:

Device: /dev/sdd, 1 Offline uncorrectable sectors


For details see host's SYSLOG (default: /var/log/messages).

You can also use the smartctl utility for further investigation.
No additional email messages about this problem will be sent.
-------------------------------------------------------
The following partitions are affected:
Device /dev/sdd is part of following software-raid(s)
 - /dev/md4 mounted on /data
 - /dev/md3 mounted on /
 - /dev/md2 mounted on
 - /dev/md1 mounted on /boot
-------------------------------------------------------

The EIS/FAIR S.M.A.R.T. Daemon

Die SMART-Werte kann ich mit auch ansehen.

eis # smartctl -l selftest /dev/sdd
smartctl 5.39 2009-12-09 r2995 [i686-pc-linux-gnu] (local build)
Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       20%     64419
    712048213
# 2  Short offline       Completed without error       00%     64416
    -
# 3  Short offline       Completed without error       00%     64392
    -
# 4  Short offline       Completed without error       00%     64368
    -
# 5  Short offline       Completed without error       00%     64344
    -
# 6  Short offline       Completed without error       00%     64320
    -
# 7  Short offline       Completed without error       00%     64296
    -
# 8  Short offline       Completed without error       00%     64272
    -
# 9  Extended offline    Completed: read failure       20%     64251
    712048214
#10  Short offline       Completed without error       00%     64248
    -
#11  Short offline       Completed without error       00%     64224
    -
#12  Short offline       Completed without error       00%     64200
    -
#13  Short offline       Completed without error       00%     64177
    -
#14  Short offline       Completed without error       00%     64153
    -
#15  Short offline       Completed without error       00%     64129
    -
#16  Short offline       Completed without error       00%     64105
    -
#17  Extended offline    Completed: read failure       20%     64083
    712048213
#18  Short offline       Completed without error       00%     64081
    -
#19  Short offline       Completed without error       00%     64057
    -
#20  Short offline       Completed without error       00%     64033
    -
#21  Short offline       Completed without error       00%     64009
    -

oder auch das hier

******************************************************************************************

 Short report for drive '/dev/sdd'

******************************************************************************************
                                                    smartctl 5.39
2009-12-09 r2995 [i686-pc-linux-gnu] (local build)

Copyright (C) 2002-9 by Bruce Allen,
http://smartmontools.sourceforge.net



=== START OF READ SMART DATA SECTION ===

SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
UPDATED  WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate     0x000f   100   099   051    Pre-fail  Always
    -       4
3 Spin_Up_Time            0x0007   084   084   011    Pre-fail  Always
    -       5640
4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always
    -       25
5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always
    -       0
7 Seek_Error_Rate         0x000f   100   100   051    Pre-fail  Always
    -       0
8 Seek_Time_Performance   0x0025   100   100   015    Pre-fail  Offline
    -       11058
9 Power_On_Hours          0x0032   087   087   000    Old_age   Always
    -       64428
10 Spin_Retry_Count        0x0033   100   100   051    Pre-fail  Always
     -       0
11 Calibration_Retry_Count 0x0012   100   100   000    Old_age   Always
     -       0
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always
     -       25
13 Read_Soft_Error_Rate    0x000e   100   099   000    Old_age   Always
     -       4
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always
      -       0
184 End-to-End_Error        0x0033   100   100   000    Pre-fail  Always
      -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always
      -       60
188 Command_Timeout         0x0032   100   100   000    Old_age   Always
      -       0
190 Airflow_Temperature_Cel 0x0022   077   070   000    Old_age   Always
      -       23 (Lifetime Min/Max 20/29)
194 Temperature_Celsius     0x0022   077   068   000    Old_age   Always
      -       23 (Lifetime Min/Max 19/31)
195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always
      -       288756034
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always
      -       0
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always
      -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age
Offline      -       1
199 UDMA_CRC_Error_Count    0x003e   100   100   000    Old_age   Always
      -       0
200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   Always
      -       0
201 Soft_Read_Error_Rate    0x000a   100   100   000    Old_age   Always
      -       0

Gruß
Jürgen



Mehr Informationen über die Mailingliste Eisfair