Pull networking updates from Jakub Kicinski:
"Core & protocols:
- Wrap datapath globals into net_aligned_data, to avoid false sharing
- Preserve MSG_ZEROCOPY in forwarding (e.g. out of a container)
- Add SO_INQ and SCM_INQ support to AF_UNIX
- Add SIOCINQ support to AF_VSOCK
- Add TCP_MAXSEG sockopt to MPTCP
- Add IPv6 force_forwarding sysctl to enable forwarding per interface
- Make TCP validation of whether packet fully fits in the receive
window and the rcv_buf more strict. With increased use of HW
aggregation a single "packet" can be multiple 100s of kB
- Add MSG_MORE flag to optimize large TCP transmissions via sockmap,
improves latency up to 33% for sockmap users
- Convert TCP send queue handling from tasklet to BH workque
- Improve BPF iteration over TCP sockets to see each socket exactly
once
- Remove obsolete and unused TCP RFC3517/RFC6675 loss recovery code
- Support enabling kernel threads for NAPI processing on per-NAPI
instance basis rather than a whole device. Fully stop the kernel
NAPI thread when threaded NAPI gets disabled. Previously thread
would stick around until ifdown due to tricky synchronization
- Allow multicast routing to take effect on locally-generated packets
- Add output interface argument for End.X in segment routing
- MCTP: add support for gateway routing, improve bind() handling
- Don't require rtnl_lock when fetching an IPv6 neighbor over Netlink
- Add a new neighbor flag ("extern_valid"), which cedes refresh
responsibilities to userspace. This is needed for EVPN multi-homing
where a neighbor entry for a multi-homed host needs to be synced
across all the VTEPs among which the host is multi-homed
- Support NUD_PERMANENT for proxy neighbor entries
- Add a new queuing discipline for IETF RFC9332 DualQ Coupled AQM
- Add sequence numbers to netconsole messages. Unregister
netconsole's console when all net targets are removed. Code
refactoring. Add a number of selftests
- Align IPSec inbound SA lookup to RFC 4301. Only SPI and protocol
should be used for an inbound SA lookup
- Support inspecting ref_tracker state via DebugFS
- Don't force bonding advertisement frames tx to ~333 ms boundaries.
Add broadcast_neighbor option to send ARP/ND on all bonded links
- Allow providing upcall pid for the 'execute' command in openvswitch
- Remove DCCP support from Netfilter's conntrack
- Disallow multiple packet duplications in the queuing layer
- Prevent use of deprecated iptables code on PREEMPT_RT
Driver API:
- Support RSS and hashing configuration over ethtool Netlink
- Add dedicated ethtool callbacks for getting and setting hashing
fields
- Add support for power budget evaluation strategy in PSE /
Power-over-Ethernet. Generate Netlink events for overcurrent etc
- Support DPLL phase offset monitoring across all device inputs.
Support providing clock reference and SYNC over separate DPLL
inputs
- Support traffic classes in devlink rate API for bandwidth
management
- Remove rtnl_lock dependency from UDP tunnel port configuration
Device drivers:
- Add a new Broadcom driver for 800G Ethernet (bnge)
- Add a standalone driver for Microchip ZL3073x DPLL
- Remove IBM's NETIUCV device driver
- Ethernet high-speed NICs:
- Broadcom (bnxt):
- support zero-copy Tx of DMABUF memory
- take page size into account for page pool recycling rings
- Intel (100G, ice, idpf):
- idpf: XDP and AF_XDP support preparations
- idpf: add flow steering
- add link_down_events statistic
- clean up the TSPLL code
- preparations for live VM migration
- nVidia/Mellanox:
- support zero-copy Rx/Tx interfaces (DMABUF and io_uring)
- optimize context memory usage for matchers
- expose serial numbers in devlink info
- support PCIe congestion metrics
- Meta (fbnic):
- add 25G, 50G, and 100G link modes to phylink
- support dumping FW logs
- Marvell/Cavium:
- support for CN20K generation of the Octeon chips
- Amazon:
- add HW clock (without timestamping, just hypervisor time access)
- Ethernet virtual:
- VirtIO net:
- support segmentation of UDP-tunnel-encapsulated packets
- Google (gve):
- support packet timestamping and clock synchronization
- Microsoft vNIC:
- add handler for device-originated servicing events
- allow dynamic MSI-X vector allocation
- support Tx bandwidth clamping
- Ethernet NICs consumer, and embedded:
- AMD:
- amd-xgbe: hardware timestamping and PTP clock support
- Broadcom integrated MACs (bcmgenet, bcmasp):
- use napi_complete_done() return value to support NAPI polling
- add support for re-starting auto-negotiation
- Broadcom switches (b53):
- support BCM5325 switches
- add bcm63xx EPHY power control
- Synopsys (stmmac):
- lots of code refactoring and cleanups
- TI:
- icssg-prueth: read firmware-names from device tree
- icssg: PRP offload support
- Microchip:
- lan78xx: convert to PHYLINK for improved PHY and MAC management
- ksz: add KSZ8463 switch support
- Intel:
- support similar queue priority scheme in multi-queue and
time-sensitive networking (taprio)
- support packet pre-emption in both
- RealTek (r8169):
- enable EEE at 5Gbps on RTL8126
- Airoha:
- add PPPoE offload support
- MDIO bus controller for Airoha AN7583
- Ethernet PHYs:
- support for the IPQ5018 internal GE PHY
- micrel KSZ9477 switch-integrated PHYs:
- add MDI/MDI-X control support
- add RX error counters
- add cable test support
- add Signal Quality Indicator (SQI) reporting
- dp83tg720: improve reset handling and reduce link recovery time
- support bcm54811 (and its MII-Lite interface type)
- air_en8811h: support resume/suspend
- support PHY counters for QCA807x and QCA808x
- support WoL for QCA807x
- CAN drivers:
- rcar_canfd: support for Transceiver Delay Compensation
- kvaser: report FW versions via devlink dev info
- WiFi:
- extended regulatory info support (6 GHz)
- add statistics and beacon monitor for Multi-Link Operation (MLO)
- support S1G aggregation, improve S1G support
- add Radio Measurement action fields
- support per-radio RTS threshold
- some work around how FIPS affects wifi, which was wrong (RC4 is
used by TKIP, not only WEP)
- improvements for unsolicited probe response handling
- WiFi drivers:
- RealTek (rtw88):
- IBSS mode for SDIO devices
- RealTek (rtw89):
- BT coexistence for MLO/WiFi7
- concurrent station + P2P support
- support for USB devices RTL8851BU/RTL8852BU
- Intel (iwlwifi):
- use embedded PNVM in (to be released) FW images to fix
compatibility issues
- many cleanups (unused FW APIs, PCIe code, WoWLAN)
- some FIPS interoperability
- MediaTek (mt76):
- firmware recovery improvements
- more MLO work
- Qualcomm/Atheros (ath12k):
- fix scan on multi-radio devices
- more EHT/Wi-Fi 7 features
- encapsulation/decapsulation offload
- Broadcom (brcm80211):
- support SDIO 43751 device
- Bluetooth:
- hci_event: add support for handling LE BIG Sync Lost event
- ISO: add socket option to report packet seqnum via CMSG
- ISO: support SCM_TIMESTAMPING for ISO TS
- Bluetooth drivers:
- intel_pcie: support Function Level Reset
- nxpuart: add support for 4M baudrate
- nxpuart: implement powerup sequence, reset, FW dump, and FW loading"
* tag 'net-next-6.17' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net-next: (1742 commits)
dpll: zl3073x: Fix build failure
selftests: bpf: fix legacy netfilter options
ipv6: annotate data-races around rt->fib6_nsiblings
ipv6: fix possible infinite loop in fib6_info_uses_dev()
ipv6: prevent infinite loop in rt6_nlmsg_size()
ipv6: add a retry logic in net6_rt_notify()
vrf: Drop existing dst reference in vrf_ip6_input_dst
net/sched: taprio: align entry index attr validation with mqprio
net: fsl_pq_mdio: use dev_err_probe
selftests: rtnetlink.sh: remove esp4_offload after test
vsock: remove unnecessary null check in vsock_getname()
igb: xsk: solve negative overflow of nb_pkts in zerocopy mode
stmmac: xsk: fix negative overflow of budget in zerocopy mode
dt-bindings: ieee802154: Convert at86rf230.txt yaml format
net: dsa: microchip: Disable PTP function of KSZ8463
net: dsa: microchip: Setup fiber ports for KSZ8463
net: dsa: microchip: Write switch MAC address differently for KSZ8463
net: dsa: microchip: Use different registers for KSZ8463
net: dsa: microchip: Add KSZ8463 switch support to KSZ DSA driver
dt-bindings: net: dsa: microchip: Add KSZ8463 switch support
...
722 lines
24 KiB
C
722 lines
24 KiB
C
/* SPDX-License-Identifier: GPL-2.0 */
|
|
#ifndef LINUX_MSI_H
|
|
#define LINUX_MSI_H
|
|
|
|
/*
|
|
* This header file contains MSI data structures and functions which are
|
|
* only relevant for:
|
|
* - Interrupt core code
|
|
* - PCI/MSI core code
|
|
* - MSI interrupt domain implementations
|
|
* - IOMMU, low level VFIO, NTB and other justified exceptions
|
|
* dealing with low level MSI details.
|
|
*
|
|
* Regular device drivers have no business with any of these functions and
|
|
* especially storing MSI descriptor pointers in random code is considered
|
|
* abuse.
|
|
*
|
|
* Device driver relevant functions are available in <linux/msi_api.h>
|
|
*/
|
|
|
|
#include <linux/irqdomain_defs.h>
|
|
#include <linux/cpumask_types.h>
|
|
#include <linux/msi_api.h>
|
|
#include <linux/irq.h>
|
|
|
|
#include <asm/msi.h>
|
|
|
|
/* Dummy shadow structures if an architecture does not define them */
|
|
#ifndef arch_msi_msg_addr_lo
|
|
typedef struct arch_msi_msg_addr_lo {
|
|
u32 address_lo;
|
|
} __attribute__ ((packed)) arch_msi_msg_addr_lo_t;
|
|
#endif
|
|
|
|
#ifndef arch_msi_msg_addr_hi
|
|
typedef struct arch_msi_msg_addr_hi {
|
|
u32 address_hi;
|
|
} __attribute__ ((packed)) arch_msi_msg_addr_hi_t;
|
|
#endif
|
|
|
|
#ifndef arch_msi_msg_data
|
|
typedef struct arch_msi_msg_data {
|
|
u32 data;
|
|
} __attribute__ ((packed)) arch_msi_msg_data_t;
|
|
#endif
|
|
|
|
#ifndef arch_is_isolated_msi
|
|
#define arch_is_isolated_msi() false
|
|
#endif
|
|
|
|
/**
|
|
* msi_msg - Representation of a MSI message
|
|
* @address_lo: Low 32 bits of msi message address
|
|
* @arch_addrlo: Architecture specific shadow of @address_lo
|
|
* @address_hi: High 32 bits of msi message address
|
|
* (only used when device supports it)
|
|
* @arch_addrhi: Architecture specific shadow of @address_hi
|
|
* @data: MSI message data (usually 16 bits)
|
|
* @arch_data: Architecture specific shadow of @data
|
|
*/
|
|
struct msi_msg {
|
|
union {
|
|
u32 address_lo;
|
|
arch_msi_msg_addr_lo_t arch_addr_lo;
|
|
};
|
|
union {
|
|
u32 address_hi;
|
|
arch_msi_msg_addr_hi_t arch_addr_hi;
|
|
};
|
|
union {
|
|
u32 data;
|
|
arch_msi_msg_data_t arch_data;
|
|
};
|
|
};
|
|
|
|
/* Helper functions */
|
|
struct msi_desc;
|
|
struct pci_dev;
|
|
struct device_attribute;
|
|
struct irq_domain;
|
|
struct irq_affinity_desc;
|
|
|
|
void __get_cached_msi_msg(struct msi_desc *entry, struct msi_msg *msg);
|
|
#ifdef CONFIG_GENERIC_MSI_IRQ
|
|
void get_cached_msi_msg(unsigned int irq, struct msi_msg *msg);
|
|
#else
|
|
static inline void get_cached_msi_msg(unsigned int irq, struct msi_msg *msg) { }
|
|
#endif
|
|
|
|
typedef void (*irq_write_msi_msg_t)(struct msi_desc *desc,
|
|
struct msi_msg *msg);
|
|
|
|
/**
|
|
* pci_msi_desc - PCI/MSI specific MSI descriptor data
|
|
*
|
|
* @msi_mask: [PCI MSI] MSI cached mask bits
|
|
* @msix_ctrl: [PCI MSI-X] MSI-X cached per vector control bits
|
|
* @is_msix: [PCI MSI/X] True if MSI-X
|
|
* @multiple: [PCI MSI/X] log2 num of messages allocated
|
|
* @multi_cap: [PCI MSI/X] log2 num of messages supported
|
|
* @can_mask: [PCI MSI/X] Masking supported?
|
|
* @is_64: [PCI MSI/X] Address size: 0=32bit 1=64bit
|
|
* @default_irq:[PCI MSI/X] The default pre-assigned non-MSI irq
|
|
* @mask_pos: [PCI MSI] Mask register position
|
|
* @mask_base: [PCI MSI-X] Mask register base address
|
|
*/
|
|
struct pci_msi_desc {
|
|
union {
|
|
u32 msi_mask;
|
|
u32 msix_ctrl;
|
|
};
|
|
struct {
|
|
u8 is_msix : 1;
|
|
u8 multiple : 3;
|
|
u8 multi_cap : 3;
|
|
u8 can_mask : 1;
|
|
u8 is_64 : 1;
|
|
u8 is_virtual : 1;
|
|
unsigned default_irq;
|
|
} msi_attrib;
|
|
union {
|
|
u8 mask_pos;
|
|
void __iomem *mask_base;
|
|
};
|
|
};
|
|
|
|
/**
|
|
* union msi_domain_cookie - Opaque MSI domain specific data
|
|
* @value: u64 value store
|
|
* @ptr: Pointer to domain specific data
|
|
* @iobase: Domain specific IOmem pointer
|
|
*
|
|
* The content of this data is implementation defined and used by the MSI
|
|
* domain to store domain specific information which is requried for
|
|
* interrupt chip callbacks.
|
|
*/
|
|
union msi_domain_cookie {
|
|
u64 value;
|
|
void *ptr;
|
|
void __iomem *iobase;
|
|
};
|
|
|
|
/**
|
|
* struct msi_desc_data - Generic MSI descriptor data
|
|
* @dcookie: Cookie for MSI domain specific data which is required
|
|
* for irq_chip callbacks
|
|
* @icookie: Cookie for the MSI interrupt instance provided by
|
|
* the usage site to the allocation function
|
|
*
|
|
* The content of this data is implementation defined, e.g. PCI/IMS
|
|
* implementations define the meaning of the data. The MSI core ignores
|
|
* this data completely.
|
|
*/
|
|
struct msi_desc_data {
|
|
union msi_domain_cookie dcookie;
|
|
union msi_instance_cookie icookie;
|
|
};
|
|
|
|
#define MSI_MAX_INDEX ((unsigned int)USHRT_MAX)
|
|
|
|
/**
|
|
* struct msi_desc - Descriptor structure for MSI based interrupts
|
|
* @irq: The base interrupt number
|
|
* @nvec_used: The number of vectors used
|
|
* @dev: Pointer to the device which uses this descriptor
|
|
* @msg: The last set MSI message cached for reuse
|
|
* @affinity: Optional pointer to a cpu affinity mask for this descriptor
|
|
* @iommu_msi_iova: Optional shifted IOVA from the IOMMU to override the msi_addr.
|
|
* Only used if iommu_msi_shift != 0
|
|
* @iommu_msi_shift: Indicates how many bits of the original address should be
|
|
* preserved when using iommu_msi_iova.
|
|
* @sysfs_attr: Pointer to sysfs device attribute
|
|
*
|
|
* @write_msi_msg: Callback that may be called when the MSI message
|
|
* address or data changes
|
|
* @write_msi_msg_data: Data parameter for the callback.
|
|
*
|
|
* @msi_index: Index of the msi descriptor
|
|
* @pci: PCI specific msi descriptor data
|
|
* @data: Generic MSI descriptor data
|
|
*/
|
|
struct msi_desc {
|
|
/* Shared device/bus type independent data */
|
|
unsigned int irq;
|
|
unsigned int nvec_used;
|
|
struct device *dev;
|
|
struct msi_msg msg;
|
|
struct irq_affinity_desc *affinity;
|
|
#ifdef CONFIG_IRQ_MSI_IOMMU
|
|
u64 iommu_msi_iova : 58;
|
|
u64 iommu_msi_shift : 6;
|
|
#endif
|
|
#ifdef CONFIG_SYSFS
|
|
struct device_attribute *sysfs_attrs;
|
|
#endif
|
|
|
|
void (*write_msi_msg)(struct msi_desc *entry, void *data);
|
|
void *write_msi_msg_data;
|
|
|
|
u16 msi_index;
|
|
union {
|
|
struct pci_msi_desc pci;
|
|
struct msi_desc_data data;
|
|
};
|
|
};
|
|
|
|
/*
|
|
* Filter values for the MSI descriptor iterators and accessor functions.
|
|
*/
|
|
enum msi_desc_filter {
|
|
/* All descriptors */
|
|
MSI_DESC_ALL,
|
|
/* Descriptors which have no interrupt associated */
|
|
MSI_DESC_NOTASSOCIATED,
|
|
/* Descriptors which have an interrupt associated */
|
|
MSI_DESC_ASSOCIATED,
|
|
};
|
|
|
|
|
|
/**
|
|
* struct msi_dev_domain - The internals of MSI domain info per device
|
|
* @store: Xarray for storing MSI descriptor pointers
|
|
* @irqdomain: Pointer to a per device interrupt domain
|
|
*/
|
|
struct msi_dev_domain {
|
|
struct xarray store;
|
|
struct irq_domain *domain;
|
|
};
|
|
|
|
int msi_setup_device_data(struct device *dev);
|
|
|
|
void __msi_lock_descs(struct device *dev);
|
|
void __msi_unlock_descs(struct device *dev);
|
|
|
|
DEFINE_LOCK_GUARD_1(msi_descs_lock, struct device, __msi_lock_descs(_T->lock),
|
|
__msi_unlock_descs(_T->lock));
|
|
|
|
struct msi_desc *msi_domain_first_desc(struct device *dev, unsigned int domid,
|
|
enum msi_desc_filter filter);
|
|
|
|
/**
|
|
* msi_first_desc - Get the first MSI descriptor of the default irqdomain
|
|
* @dev: Device to operate on
|
|
* @filter: Descriptor state filter
|
|
*
|
|
* Must be called with the MSI descriptor mutex held, i.e. msi_lock_descs()
|
|
* must be invoked before the call.
|
|
*
|
|
* Return: Pointer to the first MSI descriptor matching the search
|
|
* criteria, NULL if none found.
|
|
*/
|
|
static inline struct msi_desc *msi_first_desc(struct device *dev,
|
|
enum msi_desc_filter filter)
|
|
{
|
|
return msi_domain_first_desc(dev, MSI_DEFAULT_DOMAIN, filter);
|
|
}
|
|
|
|
struct msi_desc *msi_next_desc(struct device *dev, unsigned int domid,
|
|
enum msi_desc_filter filter);
|
|
|
|
/**
|
|
* msi_domain_for_each_desc - Iterate the MSI descriptors in a specific domain
|
|
*
|
|
* @desc: struct msi_desc pointer used as iterator
|
|
* @dev: struct device pointer - device to iterate
|
|
* @domid: The id of the interrupt domain which should be walked.
|
|
* @filter: Filter for descriptor selection
|
|
*
|
|
* Notes:
|
|
* - The loop must be protected with a msi_lock_descs()/msi_unlock_descs()
|
|
* pair.
|
|
* - It is safe to remove a retrieved MSI descriptor in the loop.
|
|
*/
|
|
#define msi_domain_for_each_desc(desc, dev, domid, filter) \
|
|
for ((desc) = msi_domain_first_desc((dev), (domid), (filter)); (desc); \
|
|
(desc) = msi_next_desc((dev), (domid), (filter)))
|
|
|
|
/**
|
|
* msi_for_each_desc - Iterate the MSI descriptors in the default irqdomain
|
|
*
|
|
* @desc: struct msi_desc pointer used as iterator
|
|
* @dev: struct device pointer - device to iterate
|
|
* @filter: Filter for descriptor selection
|
|
*
|
|
* Notes:
|
|
* - The loop must be protected with a msi_lock_descs()/msi_unlock_descs()
|
|
* pair.
|
|
* - It is safe to remove a retrieved MSI descriptor in the loop.
|
|
*/
|
|
#define msi_for_each_desc(desc, dev, filter) \
|
|
msi_domain_for_each_desc((desc), (dev), MSI_DEFAULT_DOMAIN, (filter))
|
|
|
|
#define msi_desc_to_dev(desc) ((desc)->dev)
|
|
|
|
static inline void msi_desc_set_iommu_msi_iova(struct msi_desc *desc, u64 msi_iova,
|
|
unsigned int msi_shift)
|
|
{
|
|
#ifdef CONFIG_IRQ_MSI_IOMMU
|
|
desc->iommu_msi_iova = msi_iova >> msi_shift;
|
|
desc->iommu_msi_shift = msi_shift;
|
|
#endif
|
|
}
|
|
|
|
/**
|
|
* msi_msg_set_addr() - Set MSI address in an MSI message
|
|
*
|
|
* @desc: MSI descriptor that may carry an IOVA base address for MSI via @iommu_msi_iova/shift
|
|
* @msg: Target MSI message to set its address_hi and address_lo
|
|
* @msi_addr: Physical address to set the MSI message
|
|
*
|
|
* Notes:
|
|
* - Override @msi_addr using the IOVA base address in the @desc if @iommu_msi_shift is set
|
|
* - Otherwise, simply set @msi_addr to @msg
|
|
*/
|
|
static inline void msi_msg_set_addr(struct msi_desc *desc, struct msi_msg *msg,
|
|
phys_addr_t msi_addr)
|
|
{
|
|
#ifdef CONFIG_IRQ_MSI_IOMMU
|
|
if (desc->iommu_msi_shift) {
|
|
u64 msi_iova = desc->iommu_msi_iova << desc->iommu_msi_shift;
|
|
|
|
msg->address_hi = upper_32_bits(msi_iova);
|
|
msg->address_lo = lower_32_bits(msi_iova) |
|
|
(msi_addr & ((1 << desc->iommu_msi_shift) - 1));
|
|
return;
|
|
}
|
|
#endif
|
|
msg->address_hi = upper_32_bits(msi_addr);
|
|
msg->address_lo = lower_32_bits(msi_addr);
|
|
}
|
|
|
|
int msi_domain_insert_msi_desc(struct device *dev, unsigned int domid,
|
|
struct msi_desc *init_desc);
|
|
/**
|
|
* msi_insert_msi_desc - Allocate and initialize a MSI descriptor in the
|
|
* default irqdomain and insert it at @init_desc->msi_index
|
|
* @dev: Pointer to the device for which the descriptor is allocated
|
|
* @init_desc: Pointer to an MSI descriptor to initialize the new descriptor
|
|
*
|
|
* Return: 0 on success or an appropriate failure code.
|
|
*/
|
|
static inline int msi_insert_msi_desc(struct device *dev, struct msi_desc *init_desc)
|
|
{
|
|
return msi_domain_insert_msi_desc(dev, MSI_DEFAULT_DOMAIN, init_desc);
|
|
}
|
|
|
|
void msi_domain_free_msi_descs_range(struct device *dev, unsigned int domid,
|
|
unsigned int first, unsigned int last);
|
|
|
|
/**
|
|
* msi_free_msi_descs_range - Free a range of MSI descriptors of a device
|
|
* in the default irqdomain
|
|
*
|
|
* @dev: Device for which to free the descriptors
|
|
* @first: Index to start freeing from (inclusive)
|
|
* @last: Last index to be freed (inclusive)
|
|
*/
|
|
static inline void msi_free_msi_descs_range(struct device *dev, unsigned int first,
|
|
unsigned int last)
|
|
{
|
|
msi_domain_free_msi_descs_range(dev, MSI_DEFAULT_DOMAIN, first, last);
|
|
}
|
|
|
|
/**
|
|
* msi_free_msi_descs - Free all MSI descriptors of a device in the default irqdomain
|
|
* @dev: Device to free the descriptors
|
|
*/
|
|
static inline void msi_free_msi_descs(struct device *dev)
|
|
{
|
|
msi_free_msi_descs_range(dev, 0, MSI_MAX_INDEX);
|
|
}
|
|
|
|
/*
|
|
* The arch hooks to setup up msi irqs. Default functions are implemented
|
|
* as weak symbols so that they /can/ be overriden by architecture specific
|
|
* code if needed. These hooks can only be enabled by the architecture.
|
|
*
|
|
* If CONFIG_PCI_MSI_ARCH_FALLBACKS is not selected they are replaced by
|
|
* stubs with warnings.
|
|
*/
|
|
#ifdef CONFIG_PCI_MSI_ARCH_FALLBACKS
|
|
int arch_setup_msi_irq(struct pci_dev *dev, struct msi_desc *desc);
|
|
void arch_teardown_msi_irq(unsigned int irq);
|
|
int arch_setup_msi_irqs(struct pci_dev *dev, int nvec, int type);
|
|
void arch_teardown_msi_irqs(struct pci_dev *dev);
|
|
#endif /* CONFIG_PCI_MSI_ARCH_FALLBACKS */
|
|
|
|
/*
|
|
* Xen uses non-default msi_domain_ops and hence needs a way to populate sysfs
|
|
* entries of MSI IRQs.
|
|
*/
|
|
#if defined(CONFIG_PCI_XEN) || defined(CONFIG_PCI_MSI_ARCH_FALLBACKS)
|
|
#ifdef CONFIG_SYSFS
|
|
int msi_device_populate_sysfs(struct device *dev);
|
|
void msi_device_destroy_sysfs(struct device *dev);
|
|
#else /* CONFIG_SYSFS */
|
|
static inline int msi_device_populate_sysfs(struct device *dev) { return 0; }
|
|
static inline void msi_device_destroy_sysfs(struct device *dev) { }
|
|
#endif /* !CONFIG_SYSFS */
|
|
#endif /* CONFIG_PCI_XEN || CONFIG_PCI_MSI_ARCH_FALLBACKS */
|
|
|
|
/*
|
|
* The restore hook is still available even for fully irq domain based
|
|
* setups. Courtesy to XEN/X86.
|
|
*/
|
|
bool arch_restore_msi_irqs(struct pci_dev *dev);
|
|
|
|
#ifdef CONFIG_GENERIC_MSI_IRQ
|
|
|
|
#include <linux/irqhandler.h>
|
|
|
|
struct irq_domain;
|
|
struct irq_domain_ops;
|
|
struct irq_chip;
|
|
struct irq_fwspec;
|
|
struct device_node;
|
|
struct fwnode_handle;
|
|
struct msi_domain_info;
|
|
|
|
/**
|
|
* struct msi_domain_ops - MSI interrupt domain callbacks
|
|
* @get_hwirq: Retrieve the resulting hw irq number
|
|
* @msi_init: Domain specific init function for MSI interrupts
|
|
* @msi_free: Domain specific function to free a MSI interrupts
|
|
* @msi_prepare: Prepare the allocation of the interrupts in the domain
|
|
* @msi_teardown: Reverse the effects of @msi_prepare
|
|
* @prepare_desc: Optional function to prepare the allocated MSI descriptor
|
|
* in the domain
|
|
* @set_desc: Set the msi descriptor for an interrupt
|
|
* @domain_alloc_irqs: Optional function to override the default allocation
|
|
* function.
|
|
* @domain_free_irqs: Optional function to override the default free
|
|
* function.
|
|
* @msi_post_free: Optional function which is invoked after freeing
|
|
* all interrupts.
|
|
* @msi_translate: Optional translate callback to support the odd wire to
|
|
* MSI bridges, e.g. MBIGEN
|
|
*
|
|
* @get_hwirq, @msi_init and @msi_free are callbacks used by the underlying
|
|
* irqdomain.
|
|
*
|
|
* @msi_check, @msi_prepare, @msi_teardown, @prepare_desc and
|
|
* @set_desc are callbacks used by the msi_domain_alloc/free_irqs*()
|
|
* variants.
|
|
*
|
|
* @domain_alloc_irqs, @domain_free_irqs can be used to override the
|
|
* default allocation/free functions (__msi_domain_alloc/free_irqs). This
|
|
* is initially for a wrapper around XENs seperate MSI universe which can't
|
|
* be wrapped into the regular irq domains concepts by mere mortals. This
|
|
* allows to universally use msi_domain_alloc/free_irqs without having to
|
|
* special case XEN all over the place.
|
|
*/
|
|
struct msi_domain_ops {
|
|
irq_hw_number_t (*get_hwirq)(struct msi_domain_info *info,
|
|
msi_alloc_info_t *arg);
|
|
int (*msi_init)(struct irq_domain *domain,
|
|
struct msi_domain_info *info,
|
|
unsigned int virq, irq_hw_number_t hwirq,
|
|
msi_alloc_info_t *arg);
|
|
void (*msi_free)(struct irq_domain *domain,
|
|
struct msi_domain_info *info,
|
|
unsigned int virq);
|
|
int (*msi_prepare)(struct irq_domain *domain,
|
|
struct device *dev, int nvec,
|
|
msi_alloc_info_t *arg);
|
|
void (*msi_teardown)(struct irq_domain *domain,
|
|
msi_alloc_info_t *arg);
|
|
void (*prepare_desc)(struct irq_domain *domain, msi_alloc_info_t *arg,
|
|
struct msi_desc *desc);
|
|
void (*set_desc)(msi_alloc_info_t *arg,
|
|
struct msi_desc *desc);
|
|
int (*domain_alloc_irqs)(struct irq_domain *domain,
|
|
struct device *dev, int nvec);
|
|
void (*domain_free_irqs)(struct irq_domain *domain,
|
|
struct device *dev);
|
|
void (*msi_post_free)(struct irq_domain *domain,
|
|
struct device *dev);
|
|
int (*msi_translate)(struct irq_domain *domain, struct irq_fwspec *fwspec,
|
|
irq_hw_number_t *hwirq, unsigned int *type);
|
|
};
|
|
|
|
/**
|
|
* struct msi_domain_info - MSI interrupt domain data
|
|
* @flags: Flags to decribe features and capabilities
|
|
* @bus_token: The domain bus token
|
|
* @hwsize: The hardware table size or the software index limit.
|
|
* If 0 then the size is considered unlimited and
|
|
* gets initialized to the maximum software index limit
|
|
* by the domain creation code.
|
|
* @ops: The callback data structure
|
|
* @dev: Device which creates the domain
|
|
* @chip: Optional: associated interrupt chip
|
|
* @chip_data: Optional: associated interrupt chip data
|
|
* @handler: Optional: associated interrupt flow handler
|
|
* @handler_data: Optional: associated interrupt flow handler data
|
|
* @handler_name: Optional: associated interrupt flow handler name
|
|
* @alloc_data: Optional: associated interrupt allocation data
|
|
* @data: Optional: domain specific data
|
|
*/
|
|
struct msi_domain_info {
|
|
u32 flags;
|
|
enum irq_domain_bus_token bus_token;
|
|
unsigned int hwsize;
|
|
struct msi_domain_ops *ops;
|
|
struct device *dev;
|
|
struct irq_chip *chip;
|
|
void *chip_data;
|
|
irq_flow_handler_t handler;
|
|
void *handler_data;
|
|
const char *handler_name;
|
|
msi_alloc_info_t *alloc_data;
|
|
void *data;
|
|
};
|
|
|
|
/**
|
|
* struct msi_domain_template - Template for MSI device domains
|
|
* @name: Storage for the resulting name. Filled in by the core.
|
|
* @chip: Interrupt chip for this domain
|
|
* @ops: MSI domain ops
|
|
* @info: MSI domain info data
|
|
* @alloc_info: MSI domain allocation data (architecture specific)
|
|
*/
|
|
struct msi_domain_template {
|
|
char name[48];
|
|
struct irq_chip chip;
|
|
struct msi_domain_ops ops;
|
|
struct msi_domain_info info;
|
|
msi_alloc_info_t alloc_info;
|
|
};
|
|
|
|
/*
|
|
* Flags for msi_domain_info
|
|
*
|
|
* Bit 0-15: Generic MSI functionality which is not subject to restriction
|
|
* by parent domains
|
|
*
|
|
* Bit 16-31: Functionality which depends on the underlying parent domain and
|
|
* can be masked out by msi_parent_ops::init_dev_msi_info() when
|
|
* a device MSI domain is initialized.
|
|
*/
|
|
enum {
|
|
/*
|
|
* Init non implemented ops callbacks with default MSI domain
|
|
* callbacks.
|
|
*/
|
|
MSI_FLAG_USE_DEF_DOM_OPS = (1 << 0),
|
|
/*
|
|
* Init non implemented chip callbacks with default MSI chip
|
|
* callbacks.
|
|
*/
|
|
MSI_FLAG_USE_DEF_CHIP_OPS = (1 << 1),
|
|
/* Needs early activate, required for PCI */
|
|
MSI_FLAG_ACTIVATE_EARLY = (1 << 2),
|
|
/*
|
|
* Must reactivate when irq is started even when
|
|
* MSI_FLAG_ACTIVATE_EARLY has been set.
|
|
*/
|
|
MSI_FLAG_MUST_REACTIVATE = (1 << 3),
|
|
/* Populate sysfs on alloc() and destroy it on free() */
|
|
MSI_FLAG_DEV_SYSFS = (1 << 4),
|
|
/* Allocate simple MSI descriptors */
|
|
MSI_FLAG_ALLOC_SIMPLE_MSI_DESCS = (1 << 5),
|
|
/* Free MSI descriptors */
|
|
MSI_FLAG_FREE_MSI_DESCS = (1 << 6),
|
|
/* Use dev->fwnode for MSI device domain creation */
|
|
MSI_FLAG_USE_DEV_FWNODE = (1 << 7),
|
|
/* Set parent->dev into domain->pm_dev on device domain creation */
|
|
MSI_FLAG_PARENT_PM_DEV = (1 << 8),
|
|
/* Support for parent mask/unmask */
|
|
MSI_FLAG_PCI_MSI_MASK_PARENT = (1 << 9),
|
|
|
|
/* Mask for the generic functionality */
|
|
MSI_GENERIC_FLAGS_MASK = GENMASK(15, 0),
|
|
|
|
/* Mask for the domain specific functionality */
|
|
MSI_DOMAIN_FLAGS_MASK = GENMASK(31, 16),
|
|
|
|
/* Support multiple PCI MSI interrupts */
|
|
MSI_FLAG_MULTI_PCI_MSI = (1 << 16),
|
|
/* Support PCI MSIX interrupts */
|
|
MSI_FLAG_PCI_MSIX = (1 << 17),
|
|
/* Is level-triggered capable, using two messages */
|
|
MSI_FLAG_LEVEL_CAPABLE = (1 << 18),
|
|
/* MSI-X entries must be contiguous */
|
|
MSI_FLAG_MSIX_CONTIGUOUS = (1 << 19),
|
|
/* PCI/MSI-X vectors can be dynamically allocated/freed post MSI-X enable */
|
|
MSI_FLAG_PCI_MSIX_ALLOC_DYN = (1 << 20),
|
|
/* PCI MSIs cannot be steered separately to CPU cores */
|
|
MSI_FLAG_NO_AFFINITY = (1 << 21),
|
|
/* Inhibit usage of entry masking */
|
|
MSI_FLAG_NO_MASK = (1 << 22),
|
|
};
|
|
|
|
/*
|
|
* Flags for msi_parent_ops::chip_flags
|
|
*/
|
|
enum {
|
|
MSI_CHIP_FLAG_SET_EOI = (1 << 0),
|
|
MSI_CHIP_FLAG_SET_ACK = (1 << 1),
|
|
};
|
|
|
|
/**
|
|
* struct msi_parent_ops - MSI parent domain callbacks and configuration info
|
|
*
|
|
* @supported_flags: Required: The supported MSI flags of the parent domain
|
|
* @required_flags: Optional: The required MSI flags of the parent MSI domain
|
|
* @chip_flags: Optional: Select MSI chip callbacks to update with defaults
|
|
* in msi_lib_init_dev_msi_info().
|
|
* @bus_select_token: Optional: The bus token of the real parent domain for
|
|
* irq_domain::select()
|
|
* @bus_select_mask: Optional: A mask of supported BUS_DOMAINs for
|
|
* irq_domain::select()
|
|
* @prefix: Optional: Prefix for the domain and chip name
|
|
* @init_dev_msi_info: Required: Callback for MSI parent domains to setup parent
|
|
* domain specific domain flags, domain ops and interrupt chip
|
|
* callbacks when a per device domain is created.
|
|
*/
|
|
struct msi_parent_ops {
|
|
u32 supported_flags;
|
|
u32 required_flags;
|
|
u32 chip_flags;
|
|
u32 bus_select_token;
|
|
u32 bus_select_mask;
|
|
const char *prefix;
|
|
bool (*init_dev_msi_info)(struct device *dev, struct irq_domain *domain,
|
|
struct irq_domain *msi_parent_domain,
|
|
struct msi_domain_info *msi_child_info);
|
|
};
|
|
|
|
bool msi_parent_init_dev_msi_info(struct device *dev, struct irq_domain *domain,
|
|
struct irq_domain *msi_parent_domain,
|
|
struct msi_domain_info *msi_child_info);
|
|
|
|
int msi_domain_set_affinity(struct irq_data *data, const struct cpumask *mask,
|
|
bool force);
|
|
|
|
struct irq_domain *msi_create_irq_domain(struct fwnode_handle *fwnode,
|
|
struct msi_domain_info *info,
|
|
struct irq_domain *parent);
|
|
|
|
struct irq_domain_info;
|
|
struct irq_domain *msi_create_parent_irq_domain(struct irq_domain_info *info,
|
|
const struct msi_parent_ops *msi_parent_ops);
|
|
|
|
bool msi_create_device_irq_domain(struct device *dev, unsigned int domid,
|
|
const struct msi_domain_template *template,
|
|
unsigned int hwsize, void *domain_data,
|
|
void *chip_data);
|
|
void msi_remove_device_irq_domain(struct device *dev, unsigned int domid);
|
|
|
|
bool msi_match_device_irq_domain(struct device *dev, unsigned int domid,
|
|
enum irq_domain_bus_token bus_token);
|
|
|
|
int msi_domain_alloc_irqs_range_locked(struct device *dev, unsigned int domid,
|
|
unsigned int first, unsigned int last);
|
|
int msi_domain_alloc_irqs_range(struct device *dev, unsigned int domid,
|
|
unsigned int first, unsigned int last);
|
|
int msi_domain_alloc_irqs_all_locked(struct device *dev, unsigned int domid, int nirqs);
|
|
|
|
struct msi_map msi_domain_alloc_irq_at(struct device *dev, unsigned int domid, unsigned int index,
|
|
const struct irq_affinity_desc *affdesc,
|
|
union msi_instance_cookie *cookie);
|
|
|
|
void msi_domain_free_irqs_range_locked(struct device *dev, unsigned int domid,
|
|
unsigned int first, unsigned int last);
|
|
void msi_domain_free_irqs_range(struct device *dev, unsigned int domid,
|
|
unsigned int first, unsigned int last);
|
|
void msi_domain_free_irqs_all_locked(struct device *dev, unsigned int domid);
|
|
void msi_domain_free_irqs_all(struct device *dev, unsigned int domid);
|
|
|
|
struct msi_domain_info *msi_get_domain_info(struct irq_domain *domain);
|
|
|
|
/* Per device platform MSI */
|
|
int platform_device_msi_init_and_alloc_irqs(struct device *dev, unsigned int nvec,
|
|
irq_write_msi_msg_t write_msi_msg);
|
|
void platform_device_msi_free_irqs_all(struct device *dev);
|
|
|
|
bool msi_device_has_isolated_msi(struct device *dev);
|
|
|
|
static inline int msi_domain_alloc_irqs(struct device *dev, unsigned int domid, int nirqs)
|
|
{
|
|
return msi_domain_alloc_irqs_range(dev, domid, 0, nirqs - 1);
|
|
}
|
|
|
|
#else /* CONFIG_GENERIC_MSI_IRQ */
|
|
static inline bool msi_device_has_isolated_msi(struct device *dev)
|
|
{
|
|
/*
|
|
* Arguably if the platform does not enable MSI support then it has
|
|
* "isolated MSI", as an interrupt controller that cannot receive MSIs
|
|
* is inherently isolated by our definition. The default definition for
|
|
* arch_is_isolated_msi() is conservative and returns false anyhow.
|
|
*/
|
|
return arch_is_isolated_msi();
|
|
}
|
|
#endif /* CONFIG_GENERIC_MSI_IRQ */
|
|
|
|
/* PCI specific interfaces */
|
|
#ifdef CONFIG_PCI_MSI
|
|
struct pci_dev *msi_desc_to_pci_dev(struct msi_desc *desc);
|
|
void pci_write_msi_msg(unsigned int irq, struct msi_msg *msg);
|
|
void __pci_read_msi_msg(struct msi_desc *entry, struct msi_msg *msg);
|
|
void __pci_write_msi_msg(struct msi_desc *entry, struct msi_msg *msg);
|
|
void pci_msi_mask_irq(struct irq_data *data);
|
|
void pci_msi_unmask_irq(struct irq_data *data);
|
|
struct irq_domain *pci_msi_create_irq_domain(struct fwnode_handle *fwnode,
|
|
struct msi_domain_info *info,
|
|
struct irq_domain *parent);
|
|
u32 pci_msi_domain_get_msi_rid(struct irq_domain *domain, struct pci_dev *pdev);
|
|
struct irq_domain *pci_msi_get_device_domain(struct pci_dev *pdev);
|
|
void pci_msix_prepare_desc(struct irq_domain *domain, msi_alloc_info_t *arg,
|
|
struct msi_desc *desc);
|
|
#else /* CONFIG_PCI_MSI */
|
|
static inline struct irq_domain *pci_msi_get_device_domain(struct pci_dev *pdev)
|
|
{
|
|
return NULL;
|
|
}
|
|
static inline void pci_write_msi_msg(unsigned int irq, struct msi_msg *msg) { }
|
|
#endif /* !CONFIG_PCI_MSI */
|
|
|
|
#endif /* LINUX_MSI_H */
|