11
回答
诡异的硬盘问题
利用AWS快速构建适用于生产的无服务器应用程序,免费试用12个月>>>   

系统: CentOS 5

分区: ext3 (全盘只有一个分区)

硬盘: ST3250310SV

接口: SATA

挂载: /dev/sdb

已使用空间: 6%

 

问题:

    读取数据正常,数据正确;  写数据很慢,偶尔卡死;  删除数据,卡死!

    删除数据时: 用 iostat -x 1  查看, util 达到 100%,居高不下, 删除一个文件(普通小文件)可能需要半个小时.

    cat /var/log/messages | grep error  没有错误.

    smartctl -l error /dev/sdb   一个小时都没有出来结果...

 

请教大家, 可能是干什么原因导致这样的问题.

<无标签>
举报
G.
发帖于7年前 11回/717阅
共有11个答案 最后回答: 7年前

这块硬盘主要存放 web 的数据和 apache log ,属于读多写少.

之前没有监控硬盘的负载, 并不知道用得多厉害.

老大,不要吓我, 换硬盘很郁闷的!

诡异的是,为什么是卡死要"删除"上面呢?

# smartctl -l error /dev/sdb
smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
ATA Error Count: 14813 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 14813 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:23.503  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:23.503  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT

Error 14812 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14811 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
  27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14810 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
  27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

Error 14809 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
  27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
  ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
  27 00 00 00 00 00 e0 00      02:17:17.981  READ NATIVE MAX ADDRESS EXT

顶部