Daniel Podolsky (nil59) wrote in ru_root,
Daniel Podolsky
nil59
ru_root

[SOLVED] Slow disk producing hi LA

Похоже, помирает диск
197 Current_Pending_Sector 0x0032 189 189 000 Old_age Always - 1188

У близнеца его этот параметр - 0

День добрый!

Туплю, не могу понять, что еще посмотреть, помогите, пожалуйста :)

Имею сервер вот с таким load average: 3.20, 5.31, 3.20

Это сейчас, а с утра я его перегружал с la 240. но и 3.2 - это очень много, там активных процессов ровно 1

atop говорит мне красным по черному
DSK | sda | busy 101% | read 5 | write 26 | avio 322 ms |

то есть - проблема в диске.

Но в SMART я ничего страшного не вижу.

Порт менял, кабель менял.

Куда бы еще поглядеть?

update: похоже, до больше не реагирует адекватно ни на pre, ни на pre ни на code :(
Извините за форматирование, сделать ничего не могу
глюк исчез



Вывод atop
ATOP - fServ              2011/07/21  11:57:48               10 seconds elapsed
PRC | sys   2.55s | user   7.94s | #proc    297 | #zombie    1 | #exit    134 |
CPU | sys     25% | user     80% | irq       0% | idle    151% | wait    143% |
cpu | sys     13% | user     44% | irq       0% | idle      0% | cpu002 w 43% |
cpu | sys     10% | user     34% | irq       0% | idle      3% | cpu000 w 53% |
cpu | sys      2% | user      1% | irq       0% | idle     69% | cpu003 w 28% |
cpu | sys      1% | user      1% | irq       0% | idle     78% | cpu001 w 20% |
CPL | avg1   5.82 | avg5    5.20 | avg15   2.42 | csw    31221 | intr   35880 |
MEM | tot    7.7G | free    6.8G | cache 213.8M | buff    3.1M | slab   37.4M |
SWP | tot    0.0M | free    0.0M |              | vmcom   2.4G | vmlim   3.9G |
DSK |         sda | busy    101% | read       5 | write     26 | avio  322 ms |
DSK |         sdb | busy      2% | read       9 | write     15 | avio    7 ms |
NET | transport   | tcpi      10 | tcpo      10 | udpi      10 | udpo       4 |
NET | network     | ipi       46 | ipo       14 | ipfrw      0 | deliv     16 |
NET | vnet2    0% | pcki       2 | pcko      36 | si    0 Kbps | so    2 Kbps |
NET | vnet0    0% | pcki       1 | pcko      36 | si    0 Kbps | so    2 Kbps |
NET | em1      0% | pcki      67 | pcko      14 | si    7 Kbps | so    1 Kbps |
NET | br2    ---- | pcki      44 | pcko      12 | si    2 Kbps | so    1 Kbps |
NET | virbr0 ---- | pcki       0 | pcko       3 | si    0 Kbps | so    0 Kbps |

  PID  SYSCPU  USRCPU  VGROW  RGROW  RDDSK  WRDSK  ST EXC S  DSK CMD     1/9   
 3326   0.06s   0.06s     0K   264K   540K    20K  --   - S  96% qemu-kvm


uname -a
Linux fServ.dpHome.djarvur.net 2.6.38.8-35.fc15.x86_64 #1 SMP Wed Jul 6 13:58:54 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux


cat /etc/redhat-release 
Fedora release 15 (Lovelock)


cat /proc/mdstat 
Personalities : [raid1] 
md127 : active raid1 sda1[1] sdb1[0]
      625128121 blocks super 1.2 [2/2] [UU]
      
unused devices: 

Со вторым диском, прошу заметить, все в порядке

sudo fdisk -l /dev/sda

Disk /dev/sda: 640.1 GB, 640135028736 bytes
255 heads, 63 sectors/track, 77825 cylinders, total 1250263728 sectors
Units = sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xc3a47544

   Device Boot      Start         End      Blocks   Id  System
/dev/sda1              63  1250258624   625129281   fd  Linux raid autodetect


mount|grep md127
/dev/md127 on /storage type ext4 (rw,relatime,barrier=1,data=ordered)


Ну и наконец
sudo smartctl -a /dev/sda
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-2.6.38.8-35.fc15.x86_64] (local build)
Copyright (C) 2002-11 by Bruce Allen, hprep://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Caviar Green (Adv. Format)
Device Model:     WDC WD6400AARS-00Y5B1
Serial Number:    WD-WCAV56889179
LU WWN Device Id: 5 0014ee 203e8d675
Firmware Version: 80.00A80
User Capacity:    640,135,028,736 bytes [640 GB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Thu Jul 21 12:12:55 2011 MSK
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (12660) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 148) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x3031) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Apreributes Data Structure revision number: 16
Vendor Specific SMART Apreributes with Thresholds:
ID# ApreRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   128   128   021    Pre-fail  Always       -       6558
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       149
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   084   084   000    Old_age   Always       -       12296
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       147
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       98
193 Load_Cycle_Count        0x0032   135   135   000    Old_age   Always       -       197121
194 Temperature_Celsius     0x0022   098   095   000    Old_age   Always       -       49
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   189   189   000    Old_age   Always       -       1188
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

Subscribe

  • Windows server 2016 RDS с перемещенными профилями

    Коллеги, на терминальном сервере настроен параметр внешнего хранения профилей пользователей. Какое-то время после запуска сервера, всё идет…

  • про ФЗ-152

    Коллеги, а есть ли в природе проверенный сервис, который позволяет: 1. получить виртуалку/виртуалки в windows, чтобы в ней / в них могли работать…

  • не резолвятся имена в DNS за VPN

    Приветствую. Настраиваю VPN, чтобы засунуть во внутреннюю сеть часть сервисов. Проблема в следующем — не работает резолвинг адресов вида…

  • Post a new comment

    Error

    Anonymous comments are disabled in this journal

    default userpic

    Your reply will be screened

    Your IP address will be recorded 

  • 6 comments