И проблема эта крайне неприятная - зависания.
Компьютер зависал за сегодня уже дважды, оба раза во время бэкапа.(Но система бэкапа выполняется на другом компе, на компе, который зависал, работал только rsync для передачи файлов для бэкапа)
В первый раз он завис через час после начала бэкапа, в логах было пусто(кроме строчки syslog-ng[2546]: STATS: dropped 0, но я не думаю, что она относится к делу). Ну, я решил отложить проблему на выходные, а пока продолжить делать бэкап. Еще через 6 часов повторного начала бэкапа, компьютер снова завис, причем на этот раз я в этом момент работал за компом.
Сначала появились страшные тормоза а-ля виндовс на несколько секунд, потом они прошли, я продолжил работать, и через минуту система зависла намертво. На этот раз в логе кое-что осталось:
Код:
Apr 9 23:47:43 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:43 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:43 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:47:43 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:47:43 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:47:43 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:47:43 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:47:43 DIMM kernel: ata2: EH complete
Apr 9 23:47:45 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:45 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:45 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:47:45 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:47:45 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:47:55 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:47:55 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:47:55 DIMM kernel: ata2: EH complete
Apr 9 23:47:55 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:55 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:55 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:47:55 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:47:55 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:47:55 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:47:55 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:47:55 DIMM kernel: ata2: EH complete
Apr 9 23:47:55 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:55 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:55 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:47:55 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:47:55 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:47:55 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:47:55 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:47:55 DIMM kernel: ata2: EH complete
Apr 9 23:47:55 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:55 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:55 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:47:55 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:47:55 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:47:55 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:47:55 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:47:55 DIMM kernel: ata2: EH complete
Apr 9 23:47:55 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:47:55 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:47:55 DIMM kernel: ata2.00: cmd 25/00:00:fe:36:4b/00:02:02:00:00/e0 tag 0 dma 262144 in
Apr 9 23:48:12 DIMM slapd[4147]: conn=27 fd=18 closed (connection lost)
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e0 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM slapd[4147]: conn=26 fd=17 closed (connection lost)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE,SUGGEST_OK
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Sense Key : Medium Error [current] [descriptor]
Apr 9 23:48:23 DIMM kernel: Descriptor sense data with sense descriptors (in hex):
Apr 9 23:48:23 DIMM kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
Apr 9 23:48:23 DIMM kernel: 02 4b 37 78
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Add. Sense: Unrecovered read error - auto reallocate failed
Apr 9 23:48:23 DIMM kernel: end_request: I/O error, dev sdb, sector 38483832
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB)
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write Protect is off
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB)
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write Protect is off
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE,SUGGEST_OK
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Sense Key : Medium Error [current] [descriptor]
Apr 9 23:48:23 DIMM kernel: Descriptor sense data with sense descriptors (in hex):
Apr 9 23:48:23 DIMM kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
Apr 9 23:48:23 DIMM kernel: 02 4b 37 78
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Add. Sense: Unrecovered read error - auto reallocate failed
Apr 9 23:48:23 DIMM kernel: end_request: I/O error, dev sdb, sector 38483832
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB)
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write Protect is off
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB)
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write Protect is off
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Apr 9 23:48:23 DIMM kernel: ata2.00: BMDMA stat 0x4
Apr 9 23:48:23 DIMM kernel: ata2.00: cmd c8/00:08:76:37:4b/00:00:00:00:00/e2 tag 0 dma 4096 in
Apr 9 23:48:23 DIMM kernel: res 51/40:00:78:37:4b/40:00:02:00:00/e2 Emask 0x9 (media error)
Apr 9 23:48:23 DIMM kernel: ata2.00: status: { DRDY ERR }
Apr 9 23:48:23 DIMM kernel: ata2.00: error: { UNC }
Apr 9 23:48:23 DIMM kernel: ata2.00: configured for UDMA/133
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE,SUGGEST_OK
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Sense Key : Medium Error [current] [descriptor]
Apr 9 23:48:23 DIMM kernel: Descriptor sense data with sense descriptors (in hex):
Apr 9 23:48:23 DIMM kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
Apr 9 23:48:23 DIMM kernel: 02 4b 37 78
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Add. Sense: Unrecovered read error - auto reallocate failed
Apr 9 23:48:23 DIMM kernel: end_request: I/O error, dev sdb, sector 38483832
Apr 9 23:48:23 DIMM kernel: ata2: EH complete
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB)
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write Protect is off
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Apr 9 23:48:23 DIMM kernel: sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Apr 9 23:50:39 DIMM syslog-ng[2546]: STATS: dropped 0
Как видно по логу, действительно, сначала появились в логе сообщения об ошибке, связанной с жестким диском, а через пару минут комп завис(оба зависания происходили примерно во время появления в логе строчки "syslog-ng[2546]: STATS:dropped 0"
Лично я из всего этого делаю вывод, что проблема в дисковой подсистеме.
За это говорит еще и то, что я довольно часто нагружал и процессор, и оперативную память, никогда прболем не было, а серьезных нагрузок на жесткий диск не приходилось давно, и вот, они появились - бэкап, что и приводило к зависанию.
Однако, ситуация для меня не до конца ясная - я не совсем понимаю, что это за ошибки, и что они значат, и что мне делать дальше.
Поэтому я надеюсь, что кто-то из знающих людей поможет советом. Заранее спасибо.