Läs & skriv på en zpool = panic (FreeBSD som kvmgäst)

Permalänk
Medlem

Läs & skriv på en zpool = panic (FreeBSD som kvmgäst)

Hej alla,

Har en maskin som kör Debian Squeeze (amd64) med kvm. En av kvmgästerna kör FreeBSD 8.2-RELEASE (amd64).

FreeBSD-gästen har 5GB tilldelat minne, systemdisken som en raw-image samt 6st tilldelade diskar.

När jag både läser och skriver från min zpool (t ex, skriver från en dator över nfs och läser från en annan - också över nfs) får jag en panic som ser ut såhär:

May 14 03:04:00 filserver syslogd: kernel boot file is /boot/kernel/kernel May 14 03:04:00 filserver kernel: May 14 03:04:00 filserver kernel: = DPL 0, pres 1, long 1, def32 0, gran 1 May 14 03:04:00 filserver kernel: processor eflags = interrupt enabled, resume, IOPL = 0 May 14 03:04:00 filserver kernel: current process = 1111 (find) May 14 03:04:00 filserver kernel: trap number = 12 May 14 03:04:00 filserver kernel: panic: page fault May 14 03:04:00 filserver kernel: cpuid = 0 May 14 03:04:00 filserver kernel: KDB: stack backtrace: May 14 03:04:00 filserver kernel: #0 0xffffffff805f4e0e at kdb_backtrace+0x5e May 14 03:04:00 filserver kernel: #1 0xffffffff805c2d07 at panic+0x187 May 14 03:04:00 filserver kernel: #2 0xffffffff808ac600 at trap_fatal+0x290 May 14 03:04:00 filserver kernel: #3 0xffffffff808ac9df at trap_pfault+0x28f May 14 03:04:00 filserver kernel: #4 0xffffffff808acebf at trap+0x3df May 14 03:04:00 filserver kernel: #5 0xffffffff80894fb4 at calltrap+0x8 May 14 03:04:00 filserver kernel: #6 0xffffffff80e809c4 at zap_cursor_retrieve+0x194 May 14 03:04:00 filserver kernel: #7 0xffffffff80ea3fe6 at zfs_freebsd_readdir+0x2b6 May 14 03:04:00 filserver kernel: #8 0xffffffff8065c7a7 at kern_getdirentries+0x217 May 14 03:04:00 filserver kernel: #9 0xffffffff8065ca53 at getdirentries+0x23 May 14 03:04:00 filserver kernel: #10 0xffffffff80600dd5 at syscallenter+0x1e5 May 14 03:04:00 filserver kernel: #11 0xffffffff808aca5b at syscall+0x4b May 14 03:04:00 filserver kernel: #12 0xffffffff80895292 at Xfast_syscall+0xe2 May 14 03:04:00 filserver kernel: Uptime: 45m11s May 14 03:04:00 filserver kernel: Cannot dump. Device not defined or unavailable. May 14 03:04:00 filserver kernel: Automatic reboot in 15 seconds - press a key on the console to abort May 14 03:04:00 filserver kernel: Rebooting...

Vet inte om det är relaterat, men innan panic:en får jag även en del såna här felmeddelanden:

May 14 03:02:50 filserver kernel: ed0: NIC memory corrupt - invalid packet length 4699 May 14 03:02:50 filserver kernel: ed0: ed_stop_hw RST never set

libvirt-definitionen av kvm-gästen ser ut såhär:

<domain type='kvm' id='4'> <name>fileserver</name> <uuid>0942d0bc-5b60-cc88-b5e7-a03898029527</uuid> <memory>5242880</memory> <currentMemory>5242880</currentMemory> <vcpu>1</vcpu> <os> <type arch='x86_64' machine='pc-0.12'>hvm</type> <boot dev='hd'/> </os> <features> <acpi/> <apic/> <pae/> </features> <clock offset='utc'/> <on_poweroff>destroy</on_poweroff> <on_reboot>restart</on_reboot> <on_crash>restart</on_crash> <devices> <emulator>/usr/bin/kvm</emulator> <disk type='file' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source file='/kvm/images/fileserver.img'/> <target dev='hda' bus='ide'/> <alias name='ide0-0-0'/> <address type='drive' controller='0' bus='0' unit='0'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGR1ZYS'/> <target dev='sdc' bus='scsi'/> <alias name='scsi0-0-2'/> <address type='drive' controller='0' bus='0' unit='2'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGR21HS'/> <target dev='sdd' bus='scsi'/> <alias name='scsi0-0-3'/> <address type='drive' controller='0' bus='0' unit='3'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGRXSDS'/> <target dev='sde' bus='scsi'/> <alias name='scsi0-0-4'/> <address type='drive' controller='0' bus='0' unit='4'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0265752'/> <target dev='sdf' bus='scsi'/> <alias name='scsi0-0-5'/> <address type='drive' controller='0' bus='0' unit='5'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0159792'/> <target dev='sdh' bus='scsi'/> <alias name='scsi1-0-0'/> <address type='drive' controller='1' bus='0' unit='0'/> </disk> <disk type='block' device='disk'> <driver name='qemu' type='raw' cache='none'/> <source dev='/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0180957'/> <target dev='sdg' bus='scsi'/> <alias name='scsi0-0-6'/> <address type='drive' controller='0' bus='0' unit='6'/> </disk> <disk type='block' device='cdrom'> <driver name='qemu' type='raw'/> <target dev='hdc' bus='ide'/> <readonly/> <alias name='ide0-1-0'/> <address type='drive' controller='0' bus='1' unit='0'/> </disk> <controller type='ide' index='0'> <alias name='ide0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x01' function='0x1'/> </controller> <controller type='scsi' index='0'> <alias name='scsi0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x05' function='0x0'/> </controller> <controller type='scsi' index='1'> <alias name='scsi1'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x06' function='0x0'/> </controller> <interface type='bridge'> <mac address='52:54:00:56:3f:67'/> <source bridge='internal'/> <target dev='vnet3'/> <model type='ne2k_pci'/> <alias name='net0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/> </interface> <serial type='pty'> <source path='/dev/pts/3'/> <target port='0'/> <alias name='serial0'/> </serial> <console type='pty' tty='/dev/pts/3'> <source path='/dev/pts/3'/> <target type='serial' port='0'/> <alias name='serial0'/> </console> <input type='mouse' bus='ps2'/> <graphics type='vnc' port='5901' autoport='yes' listen='127.0.0.1'/> <video> <model type='cirrus' vram='9216' heads='1'/> <alias name='video0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x02' function='0x0'/> </video> <memballoon model='virtio'> <alias name='balloon0'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x04' function='0x0'/> </memballoon> </devices> </domain>

.. vilket översätts till dessa kvm-parametrarna:

/usr/bin/kvm -S -M pc-0.12 -enable-kvm -m 5120 -smp 1,sockets=1,cores=1,threads=1 -name fileserver -uuid 0942d0bc-5b60-cc88-b5e7-a03898029527 -nodefaults -chardev socket,id=monitor,path=/var/lib/libvirt/qemu/fileserver.monitor,server,nowait -mon chardev=monitor,mode=readline -rtc base=utc -boot c -device lsi,id=scsi0,bus=pci.0,addr=0x5 -device lsi,id=scsi1,bus=pci.0,addr=0x6 -drive file=/kvm/images/fileserver.img,if=none,id=drive-ide0-0-0,boot=on,format=raw,cache=none -device ide-drive,bus=ide.0,unit=0,drive=drive-ide0-0-0,id=ide0-0-0 -drive file=/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGR1ZYS,if=none,id=drive-scsi0-0-2,format=raw,cache=none -device scsi-disk,bus=scsi0.0,scsi-id=2,drive=drive-scsi0-0-2,id=scsi0-0-2 -drive file=/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGR21HS,if=none,id=drive-scsi0-0-3,format=raw,cache=none -device scsi-disk,bus=scsi0.0,scsi-id=3,drive=drive-scsi0-0-3,id=scsi0-0-3 -drive file=/dev/disk/by-id/ata-Hitachi_HDS722020ALA330_JK1171YAGRXSDS,if=none,id=drive-scsi0-0-4,format=raw,cache=none -device scsi-disk,bus=scsi0.0,scsi-id=4,drive=drive-scsi0-0-4,id=scsi0-0-4 -drive file=/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0265752,if=none,id=drive-scsi0-0-5,format=raw,cache=none -device scsi-disk,bus=scsi0.0,scsi-id=5,drive=drive-scsi0-0-5,id=scsi0-0-5 -drive file=/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0159792,if=none,id=drive-scsi1-0-0,format=raw,cache=none -device scsi-disk,bus=scsi1.0,scsi-id=0,drive=drive-scsi1-0-0,id=scsi1-0-0 -drive file=/dev/disk/by-id/ata-WDC_WD30EZRS-00J99B0_WD-WCAWZ0180957,if=none,id=drive-scsi0-0-6,format=raw,cache=none -device scsi-disk,bus=scsi0.0,scsi-id=6,drive=drive-scsi0-0-6,id=scsi0-0-6 -drive if=none,media=cdrom,id=drive-ide0-1-0,readonly=on,format=raw -device ide-drive,bus=ide.1,unit=0,drive=drive-ide0-1-0,id=ide0-1-0 -device ne2k_pci,vlan=0,id=net0,mac=52:54:00:56:3f:67,bus=pci.0,addr=0x3 -net tap,fd=42,vlan=0,name=hostnet0 -chardev pty,id=serial0 -device isa-serial,chardev=serial0 -usb -vnc 127.0.0.1:1 -vga cirrus -device virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x4

zpool:en är definierad såhär:

pool: tank state: ONLINE scrub: none requested config: NAME STATE READ WRITE CKSUM tank ONLINE 0 0 0 raidz1 ONLINE 0 0 0 label/wd3tbdisk1 ONLINE 0 0 0 label/wd3tbdisk2 ONLINE 0 0 0 label/wd3tbdisk3 ONLINE 0 0 0 raidz1 ONLINE 0 0 0 label/hit2tbdisk1 ONLINE 0 0 0 label/hit2tbdisk2 ONLINE 0 0 0 label/hit2tbdisk3 ONLINE 0 0 0 errors: No known data errors

och båda vdev:arna har 4096b-sektorer (tvingade med gnop):

# zdb|grep ashift ashift=12 ashift=12

Någon som har en idé vad det kan bero på eller vad jag kan testa? Att enbart läsa eller enbart skriva går hur bra som helst.

Kan i princip ingenting om FreeBSD, så jag har inte testat någonting direkt.