Barrelfish inside qemu with PCIe passthrough

Recent Intel/AMD processors and chipsets support PCIe virtualization. This requires a CPU supporting VT-d/AMD-Vi and a chipset that includes an IOMMU. Linux with KVM support passing a PCIe device to qemu guests transparently. This guide focuses on Intel hardware.

Prerequisites

A computer supporting IOMMU and a processor supporting VT-d are required for this to function. Additionally, a Linux version with KVM and IOMMU support has to be running. Check for IOMMU support using dmesg | grep IOMMU . The output should be similar to the following:

ubuntu@ubuntu:~$ dmesg|grep IOMMU
[    0.000000] Intel-IOMMU: enabled
[    0.000000] Intel-IOMMU: enabled
[    0.401164] dmar: IOMMU 0: reg_base_addr fbffe000 ver 1:0 cap d2078c106f0466 ecap f020de
[    0.416382] dmar: IOMMU 1: reg_base_addr ebffc000 ver 1:0 cap d2078c106f0466 ecap f020de
[    0.436580] IOAPIC id 2 under DRHD base  0xfbffe000 IOMMU 0
[    0.442900] IOAPIC id 0 under DRHD base  0xebffc000 IOMMU 1
[    0.449217] IOAPIC id 1 under DRHD base  0xebffc000 IOMMU 1
[   30.844389] IOMMU 0 0xfbffe000: using Queued invalidation
[   30.850527] IOMMU 1 0xebffc000: using Queued invalidation
[   30.856664] IOMMU: Setting RMRR:
[   30.860391] IOMMU: Setting identity map for device 0000:00:1a.0 [0xbdcf9000 - 0xbdd1dfff]
[   30.869716] IOMMU: Setting identity map for device 0000:00:1d.0 [0xbdcf9000 - 0xbdd1dfff]
[   30.879012] IOMMU: Prepare 0-16MiB unity mapping for LPC
[   30.885068] IOMMU: Setting identity map for device 0000:00:1f.0 [0x0 - 0xffffff]

The IOMMU sometimes needs to be enabled explicitly from whithin the BIOS.

Preparations

This section assumes a Debian/Ubuntu host.

Setup kernel command line

Add intel_iommu=on vfio_iommu_type1.allow_unsafe_interrupts=1 intel_iommu=on to the kernel command line. Maybe one also needs to add pci-stub.ids=VVVV:DDDD to the kernel parameter list with the correct vendor/device ID.

Bind device to vfio-pci

Install qemu-kvm. Create a script with the following contents. Here, we call it pci-setup.sh.

modprobe vfio-pci

for dev in "$@"; do
        vendor=$(cat /sys/bus/pci/devices/$dev/vendor)
        device=$(cat /sys/bus/pci/devices/$dev/device)
        if [ -e /sys/bus/pci/devices/$dev/driver ]; then
                echo $dev > /sys/bus/pci/devices/$dev/driver/unbind
        fi
        echo $vendor $device > /sys/bus/pci/drivers/vfio-pci/new_id
done

Determining device IDs

Use lspci -Dvvnn | grep $identifier to determine PCI bus/device/function numbers. The following examples shows the desired information for a Mellanox card:

ubuntu@ubuntu:~$ lspci -Dvvnn | grep Mellanox
0000:83:00.0 Network controller [0280]: Mellanox Technologies MT27500 Family [ConnectX-3] [15b3:1003]
        Subsystem: Mellanox Technologies Device [15b3:0050]

The device ID is 0000:83:00.0 in this case. Run the above script with the device ID as parameter. For example:

root@ubuntu:~# chmod u+x pci-setup.sh 
root@ubuntu:~# ./pci-setup.sh 0000:83:00.0

Passing the device to qemu

To run qemu with PCI forwarding enabled, add the following command line options: -enable-kvm -device vfio-pci,host=83:00.0, in this case we want to forward the device 83:00.0 to qemu.

Notes

One has to use vfio-pci. The old approach with pci-stub does not work.

Resources

BarrelfishWiki: qemu_iommu (last edited 2015-04-28 11:32:20 by MoritzHoffmann)