This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Kernel log reports I/O error

Hi,

I've noticed today that my kernel log file is abnormaly increasing.
I took a look inside the log and i could discover the following messages are constantly appearing.
It looks like that i've a serious problem with my SSD.
But.. this volume is virtual. My UTM runs under ESXi 6.5.

2017:05:16-19:26:10 wolfgatefw kernel: [ 1639.107644] EXT4-fs (sda8): discard request in group:1 block:1647 count:2 failed with -5
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326141] sd 0:0:0:0: [sda] Unhandled sense code
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326148] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326151] Result: hostbyte=0x00 driverbyte=0x08
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326154] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326156] Sense Key : 0x9 [current]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326160] Info fld=0x0
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326163] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326165] ASC=0x0 ASCQ=0x0
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326169] sd 0:0:0:0: [sda] CDB:
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326171] cdb[0]=0x42: 42 00 00 00 00 00 00 00 18 00
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326180] end_request: I/O error, dev sda, sector 56868858
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326194] EXT4-fs (sda6): discard request in group:19 block:18687 count:2 failed with -5
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326241] sd 0:0:0:0: [sda] Unhandled sense code
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326245] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326247] Result: hostbyte=0x00 driverbyte=0x08
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326250] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326252] Sense Key : 0x9 [current]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326255] Info fld=0x0
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326258] sd 0:0:0:0: [sda]
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326260] ASC=0x0 ASCQ=0x0
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326263] sd 0:0:0:0: [sda] CDB:
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326265] cdb[0]=0x42: 42 00 00 00 00 00 00 00 18 00
2017:05:16-19:26:12 wolfgatefw kernel: [ 1641.326274] end_request: I/O error, dev sda, sector 56868898
...

Then i've connected on my VMWare shell and i've typed the command to retrieve the SMART information from my SSD.
Actually it is difficult for me to conclude that my SSD has really a problem since each value is far above its corresponding threshold value.
Or maybe i'm not looking at the right place or i do not interpret correctly those values?

[root@localhost:~] esxcli storage core device smart get -d t10.ATA_____SanDisk_SDSSDHII240G____________________143616400412
________
Parameter Value Threshold Worst
---------------------------- ----- --------- -----
Health Status OK N/A N/A
Media Wearout Indicator 100 0 100
Write Error Count N/A N/A N/A
Read Error Count N/A N/A N/A
Power-on Hours 253 0 100
Power Cycle Count 100 0 100
Reallocated Sector Count 100 0 100
Raw Read Error Rate N/A N/A N/A
Drive Temperature 64 0 49
Driver Rated Max Temperature N/A N/A N/A
Write Sectors TOT Count 253 0 253
Read Sectors TOT Count 253 0 253
Initial Bad Block Count N/A N/A N/A

Thank you for any help,

Fred



This thread was automatically locked due to age.
Parents
  • I would be tempted to create a new VM from ISO and then restore a configuration backup.  Any luck with that?

    Cheers - Bob

     
    Sophos UTM Community Moderator
    Sophos Certified Architect - UTM
    Sophos Certified Engineer - XG
    Gold Solution Partner since 2005
    MediaSoft, Inc. USA
  • Hello,

    I've installed a fresh ESXi 6.5 with Sophos UTM 9.500-9 on a brand new SSD . The problem reappeared.

    I did also a complete re-installation on a normal sata hard drive and this time the problem was gone!

     

    It has clearly a relationship between the SSD and the flow of strange errors reported in the kernel log.

     

    My ESXi has recognized (natively) the disk controllers. My SSD is plugged on vmhba0

    [root@localhost:~] lspci -v | grep "Class 0106" -B 1
    0000:00:17.0 SATA controller Mass storage controller: Intel Corporation Avoton AHCI Controller [vmhba0]  Class 0106: 8086:1f22
    0000:00:18.0 SATA controller Mass storage controller: Intel Corporation Avoton AHCI Controller [vmhba1] Class 0106: 8086:1f32
    0000:04:00.0 SATA controller Mass storage controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller [vmhba2] Class 0106: 1b4b:9172
    0000:09:00.0 SATA controller Mass storage controller: Marvell Technology Group Ltd. 88SE9230 PCIe SATA 6Gb/s Controller [vmhba3] Class 0106: 1b4b:9230

     

    I've encountered also a problem of slowness with my SSD and ESXi but I could solve it by applying this solution:

    https://communities.vmware.com/thread/554004

     

    Now, I run out of ideas...

Reply
  • Hello,

    I've installed a fresh ESXi 6.5 with Sophos UTM 9.500-9 on a brand new SSD . The problem reappeared.

    I did also a complete re-installation on a normal sata hard drive and this time the problem was gone!

     

    It has clearly a relationship between the SSD and the flow of strange errors reported in the kernel log.

     

    My ESXi has recognized (natively) the disk controllers. My SSD is plugged on vmhba0

    [root@localhost:~] lspci -v | grep "Class 0106" -B 1
    0000:00:17.0 SATA controller Mass storage controller: Intel Corporation Avoton AHCI Controller [vmhba0]  Class 0106: 8086:1f22
    0000:00:18.0 SATA controller Mass storage controller: Intel Corporation Avoton AHCI Controller [vmhba1] Class 0106: 8086:1f32
    0000:04:00.0 SATA controller Mass storage controller: Marvell Technology Group Ltd. 88SE9172 SATA 6Gb/s Controller [vmhba2] Class 0106: 1b4b:9172
    0000:09:00.0 SATA controller Mass storage controller: Marvell Technology Group Ltd. 88SE9230 PCIe SATA 6Gb/s Controller [vmhba3] Class 0106: 1b4b:9230

     

    I've encountered also a problem of slowness with my SSD and ESXi but I could solve it by applying this solution:

    https://communities.vmware.com/thread/554004

     

    Now, I run out of ideas...

Children