Distribute PCI Requests Across Multiple Devices

PCI requests are provisioned in a list based fashion. In SR-IOV networking devices, a set of candidate virtual functions can span multiple physical functions and physical ports. Distributing single and multiple device requests across multiple physical functions provides: * Better load distribution across available links. * Provides L2 redundancy when multiple devices are allocated on behalf of a single guest. blueprint distribute-pci-allocation Change-Id: I4af2a35de9ece709b774df9c05da33fef47157db
2014-12-16 09:22:23 -03:30 · 2014-12-16 09:22:23 -03:30 · 49e21dbad9
commit 49e21dbad9
parent b271b2b417
1 changed files with 147 additions and 0 deletions
--- a/specs/mitaka/approved/distribute-pci-allocations.rst
+++ b/specs/mitaka/approved/distribute-pci-allocations.rst
@ -0,0 +1,147 @@
+..
+ This work is licensed under a Creative Commons Attribution 3.0 Unported
+ License.
+
+ http://creativecommons.org/licenses/by/3.0/legalcode
+
+===============================================
+Distribute PCI Requests Across Multiple Devices
+===============================================
+
+https://blueprints.launchpad.net/nova/+spec/distribute-pci-allocation
+
+PCI requests are provisioned in a list based fashion. In SR-IOV networking
+devices, a set of candidate virtual functions can span multiple physical
+functions and physical ports. Distributing single and multiple device requests
+across multiple physical functions provides:
+
+* Better load distribution across available links.
+
+* Provides L2 redundancy when multiple devices are allocated on behalf of a
+  single guest.
+
+
+Problem description
+===================
+
+Simple queue based device selection for PCI SR-IOV devices does not distribute
+load across physical connections nor does it permit guests to achieve L2
+redundancy by requesting multiple SR-IOV based ports.
+
+Use Cases
+----------
+
+On systems where there is a physical function per physical network and creating
+an Openstack instance with multiple SR-IOV ports, spreading the port
+allocation across candidate physical functions provides more even device and
+link utilization as well as allowing guests to take advantage of L2 redundant
+links for bonding, etc.
+
+Proposed change
+===============
+
+The proposed change alters the PCI device request scheduling for PCI SR-IOV
+to to distribute consumption of VFs evenly across available multiple physical
+functions for the same associated label.
+
+Distribution happens in best-effort fashion, so even if PCI request cannot
+be supported in a distributed fashion, but still could be satisfied by
+the current queue based allocation schema, a guest will be booted.
+
+Alternatives
+------------
+
+There are no alternatives that directly satisfy distribution of requests for
+more even utilization. Ensuring multiple SR-IOV device requests for a guest
+span multiple physical links could employ additional port detail information
+but would still require PCI request scheduling changes.
+
+Link optimization could be improved by providing scheduling based on available
+throughput, However, this does not satisfy the physical link redundancy use
+case. Also, it would only work where throughput details are actually provided
+by the underlying device.
+
+Data model impact
+-----------------
+
+The current model should be sufficient as it contains the required
+bookkeeping.
+
+REST API impact
+---------------
+
+None.
+
+Security impact
+---------------
+
+None.
+
+Notifications impact
+--------------------
+
+None.
+
+Other end user impact
+---------------------
+
+None.
+
+Performance Impact
+------------------
+
+There is an increase in complexity in scheduling guest creation that may
+increase the amount of time taken to schedule guests that are connected to
+multiple PCI similar SR-IOV devices. It has no effect on other cases.
+
+Other deployer impact
+---------------------
+
+None.
+
+Developer impact
+----------------
+
+None.
+
+Implementation
+==============
+
+Assignee(s)
+-----------
+
+Assignee:
+ Roman Bogorodskiy <rbogorodskiy@mirantis.com>
+
+
+Work Items
+----------
+
+* Alter PCI device selection logic to support distribution across multiple PCI
+  roots (physical functions)
+* Extend PCI request API to support multiple device requests.
+* Modify scheduler code to employ the new API.
+
+Dependencies
+============
+
+None.
+
+Testing
+=======
+
+The changes to the selection logic are testable through unit testing, as is
+the extension to the PCI request API.
+
+Additionally, it's intended to cover this feature with functional tests
+as well.
+
+Documentation Impact
+====================
+
+None.
+
+References
+==========
+
+None.