诡异的硬盘问题

G. 发布于 2010/09/17 18:24
阅读 747
收藏 1

系统: CentOS 5

分区: ext3 (全盘只有一个分区)

硬盘: ST3250310SV

接口: SATA

挂载: /dev/sdb

已使用空间: 6%

 

问题:

    读取数据正常,数据正确;  写数据很慢,偶尔卡死;  删除数据,卡死!

    删除数据时: 用 iostat -x 1  查看, util 达到 100%,居高不下, 删除一个文件(普通小文件)可能需要半个小时.

    cat /var/log/messages | grep error  没有错误.

    smartctl -l error /dev/sdb   一个小时都没有出来结果...

 

请教大家, 可能是干什么原因导致这样的问题.

加载中
0
红薯
红薯

如果机器IO完全没什么负载的话,我估计这块硬盘要寿终正寝了

0
G.
G.

这块硬盘主要存放 web 的数据和 apache log ,属于读多写少.

之前没有监控硬盘的负载, 并不知道用得多厉害.

老大,不要吓我, 换硬盘很郁闷的!

诡异的是,为什么是卡死要"删除"上面呢?

0
红薯
红薯

真的很有可能是硬盘问题,我觉得你不妨未雨绸缪,要不等真正坏了,哭都没有眼泪

0
G.
G.

啊!...

郁闷!

0
G.
G.

# smartctl -l error /dev/sdb
smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
ATA Error Count: 14813 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 14813 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:23.503  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:23.503  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT

Error 14812 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14811 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14810 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
  27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14809 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
  27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:17.981  READ NATIVE MAX ADDRESS EXT

0
G.
G.

Error 14809

后面的数字是错误的 次数 吗?

0
红薯
红薯

ATA Error Count: 14813 肯定是错误啊

这块硬盘真的不能要了,赶紧备份数据吧。

0
G.
G.

ATA Error Count: 14813

0
auchan
auchan

错误14813次?

0
G.
G.

天啊...三个硬盘坏了两个....

这种硬盘拿回来给PC用, 不知道还能不能用?

比如重新低格之类的.

返回顶部
顶部