Known issues: MLNX_EN v5.8-1.1.2.1 LTS (2023)

The following is a list of general limitations and known issues for the current release version. For a list of previous known issues, see the NVIDIA Archived Known Issues file at:http://www.mellanox.com/pdf/prod_software/MLNX_EN_Archived_Known_Issues.pdf

Referee. Intern numbercompany

3247519

Description:On an Ubuntu 22.04 system, when installing using the apt installation method to install MLNX_OFED, including Open vSwitch, and if the Open vSwitch distribution package was previously installed, the installation may fail due to a residual file generated by systemd: the /etc symbolic link /systemd/ system/openvswitch-switch.service.requires/ovs-record-hostname.service -> /lib/systemd/system/ovs-record-hostname.service .
Keywords:Installation, Ubuntu 22.04, open vSwitch

Workaround:Remove (and clean: apt purge / dpkg --purge) the openvswitch-switch package before installing, or if you encounter this error message, remove the remaining file and continue with the installation:
rm-Datei /etc/systemd/system/openvswitch-switch.service.requires/ovs-record-hostname.service
apt install -f

version fixed: 5,8-1.0.1.1

3191223

Description:in old cores,/etc/init.d/openibd stopwill fail due to an existing TC rule. Since mlx5_ib is already unloaded, mlx5_core and mlx5_ib are in an inconsistent state.
Keywords:as soon as possible2, eSwitch, TC Rules

Workaround:Set eSwitch mode to Legacy before enabling SR-IOV or reload mlx5_core to change eSwitch mode to Legacy.

version fixed: 5,8-1.0.1.1

3199628

Description:ping -6 -i <interface name>it is broken in v5.18.
Keywords:NetDev, Flag -i
Workaround:On all operating systems running kernel 5.18 and earlier, remove the -i flag.

version fixed: 5,8-1.0.1.1

3002932

Description:Jumbo MTU must always be set on all uplinks (ie *_sf and *_sf_r uplinks).
Keywords:NetDev, MTU, Uplink

Workaround:Set the jumbo MTU (9216) on all interfaces related to the uplink.

version fixed: 5,8-1.0.1.1

3228788

Description:Users are advised not to run rx-tls-offload on top of Korg6.0 as the TLS module will not work properly.
Keywords:Network development, TLS

Workaround:N / D

version fixed: 5,8-1.0.1.1

3130859

Description:The yum installation method can be stopped in the newly generated installer with--add-kernel-support-construir-solo.
Keywords:Installation, hm

Workaround:Remove the original mlnx-ofed-all-5.* package and recreate the repository with: createrepo RPMS/

version fixed: 5,8-1.0.1.1

3149387

Description:The neohost-backend package (included in MLNX_OFED) strictly depends on Python 2.7 and the existence of Python/usr/bin/python. This dependency is due to a pre-install test (which is not a standard practice) so /usr/bin/python fails to install if Python 2.7 is not present.

For this reason, the default installation has been disabled on newer systems that do not have Python 2 installed by default.

If there is an explicit requirement for this installation, use the command line option--con-back-end from neohost, this sanity check will be overridden and an attempt will be made to install it independently. May not be on newer systems/usr/bin/pythoneven if Python 2 is installed; Therefore, your installation fails.

Keywords:Installation, Python 2

Workaround:If neohost-backend is needed on a newer system, install Python 2 in advance and create the symlink/usr/bin/python-> Python2.

version fixed: 5,8-1.0.1.1

3213777

Description:Oracle Enterprise Linux Version 9.0 generates kernel module packages with dependencies not provided by their own kernel RPM packages and therefore cannot be installed.
Keywords:Installation, Oracle Enterprise Linux v9.0

Workaround:N / D

version fixed: 5,8-1.0.1.1

3229904

Description:The reboot driver fails to load OFED modules after installing OFED on SLES15sp4 with the default errata kernel 5.14.21-150400.24.21.
Keywords:Installation

Workaround:Install OFED with--add-kernel-supportBandera.

version fixed: 5,8-1.0.1.1

3189424

Description:VLAN naming (like all other interface names) is limited to 16 characters. For names longer than 16 characters, the kernel generates its own VLAN interface name (VID).
Keywords:Core, VLAN, Interface Name

Workaround:Choose a name that meets the 16-character limit.

version fixed: 5,8-1.0.1.1

3220855

Description:Creating external SFs on BF ARM when the host OS (x86) does not support SFs may cause the host to fail.
Keywords:Essential and scalable features

Workaround:N / D

version fixed: 5,8-1.0.1.1

3235097

Description: iIn some topologies, e.g. B. logical partitions, mlxfwreset is not supported.
Keywords:core, mlxfwreset

Workaround:N / D

version fixed: 5,8-1.0.1.1

Referee. Intern numbercompany

3114823

Description:The first attempt to establish a new iSER connection fails with the following messages in dmesg:

No iSCSI login timeout in network portal <iSER_Target_IP_ADDR>:3260
isert: isert_get_login_rx: isert_conn 00000000e9239d52 Aborted before login request was received

After the failure, the iSER initiator successfully connects to the target, but the memory allocated for the first connection is not freed correctly. Therefore, the failed attempt also causes a memory leak.

  • Kernel.orgKern 5.18
  • RHEL 9.0
  • RHEL 8.6
  • Ubuntu 22.04
  • SLES 15 SP4

The error occurs due to a bug in the scsi_transport_iscsi module, which is not part of

MLNX_ES. Therefore, the problem in MLNX_EN cannot be fixed.

The bug was fixed in kernel 5.19 by commit f6eed15f3ea7 ("scsi: iscsi: Exclude zero from endpoint ID range").

Workaround:Update the kernel if you encounter the above errors. If the problem persists after the kernel update, ask your distribution support to apply the kernel bugfix above.
Keywords:iSER Initiator

Discovered at startup: 5.7-1.0.2.0

3096911

Description:Installing chkconfig on Rhel9.0 with OFED using yum failed (chkconfig creates a symlink /etc/init.d and OFED creates files in that directory, causing a conflict).

Workaround:Installing chkconfig before OFED.

Keywords:Installation

Discovered at startup: 5.7-1.0.2.0

3100544

Description:On a RHEL9.x system, in some cases where the modules in the inbox don't match the drivers to build, rebuilding the drivers (--add-kernel-support) works, but the built-in package doesn't get installed, with many Errors like: kernel( __rdma_block_iter_next) = 0x8e7528da is required for mlnx-ofa_kernel-modules-5.6-OFED.5.6.2.0.9.1.kver.5.14.0_70.13.1.el9_0.aarch64.aarch64

This was caused by a bug in the scripts that create the Requires and Provides headers, which get confused by dependencies between different modules in the same external package.

Workaround:dnf install kernel-modules-<kernel-version> # if not the latest.

Keywords:Installation, RHEL9.x

Discovered at startup: 5.7-1.0.2.0

3132158

Description:Compiling the rdma-core package on the Rocky 8.6 OS caused the OFED compilation to fail.

Workaround:N / D

Keywords:Installation

Discovered at startup: 5.7-1.0.2.0

3137440

Description:Python package is missing, you have to install it manually.

Workaround:Install Python before starting the build.

Keywords:Installation, Python

Discovered at startup: 5.7-1.0.2.0

3141506

Description:The kernel-macros package does not support building with KMP enabled. KMP must be disabled.

Workaround:Build and install MOFED with KMP disabled (no --kmp flag).

Keywords:Installation

Discovered at startup: 5.7-1.0.2.0

3141506

Description:The kernel-macros package does not support building with KMP enabled. KMP must be disabled.

Workaround:Build and install MOFED with KMP disabled (no --kmp flag).

Keywords:Installation

Discovered at startup: 5.7-1.0.2.0

3129627

Description:Kernel module packaging is not compatible with CtyunOS.

Workaround:N / D

Keywords:Installation

Discovered at startup: 5.7-1.0.2.0

2971708

Description:For operating systems where Devlink supports rub-enable/disable setting, sysfs rub_enable show and sysfs rub_enable set are disabled and RoCE status should be managed exclusively through Devlink.

The sysfs rub-enable/disable interface will be completely removed for these operating systems in a future release.
To determine if Devlink can be used to enable or disable RoCE, after launching OFED, run the following console command:

devlink dev parameter demo | Tau grep


Devlink supports enabling/disabling rubbing when the following line is rendered in the output:

name enable_rub generischer Typ

For operating systems that don't allow enabling/disabling RoCE via Devlink, the sysfs interface behaves like the previous 2 versions:

  1. For operating systems that have Devlink Reload but do not allow setting RoCE status via Devlink:
    sysfs rub_enable show works the same as sysfs rub_enable set but devlink reload must be done through sysfs after setting RoCE state to enable desired rub state.
  2. On operating systems without a devlink reload, RoCE status is only managed through the sysfs interface.
    'show' displays the RoCE status and 'set' sets and activates the status.
    Run the following console command (using the bash shell) to determine if Devlink dev reload is supported:

    devlink developer help 2>&1 | reload grep

    Reloading is supported when the output is:

    devlink dev reload DEV [ netns { PID | NAME | I WOULD GO } ]

Workaround:N / D

Keywords:Enable/disable RoCE

Discovered at startup: 5.7-1.0.2.0

Referee. Intern numbercompany

2971708

Description:For operating systems where Devlink supports rub-enable/disable setting, sysfs rub_enable show and sysfs rub_enable set are disabled and RoCE status should be managed exclusively through Devlink.

The sysfs rub-enable/disable interface will be completely removed for these operating systems in a future release.
To determine if Devlink can be used to enable or disable RoCE, after launching OFED, run the following console command:

devlink dev parameter demo | Tau grep


Devlink supports enabling/disabling rubbing when the following line is rendered in the output:

name enable_rub generischer Typ

For operating systems that don't allow enabling/disabling RoCE via Devlink, the sysfs interface behaves like the previous 2 versions:

  1. For operating systems that have Devlink Reload but do not allow setting RoCE status via Devlink:
    sysfs rub_enable show works the same as sysfs rub_enable set but devlink reload must be done through sysfs after setting RoCE state to enable desired rub state.
  2. On operating systems without a devlink reload, RoCE status is only managed through the sysfs interface.
    'show' displays the RoCE status and 'set' sets and activates the status.
    Run the following console command (using the bash shell) to determine if Devlink dev reload is supported:

    devlink developer help 2>&1 | reload grep

    Reloading is supported when the output is:

    devlink dev reload DEV [ netns { PID | NAME | I WOULD GO } ]

Workaround:N / D

Keywords:Enable/disable RoCE

Discovered at startup: 5.7-1.0.2.0

2998194

Description:On some systems with many (e.g. 64) virtual functions (VFs) connected to a ConnectX interface, 'ip link' can generate an error message: "Error: Buffer too small for object". This applies to both IP commands: the iproute inbox package on RHEL8.x and the mlnx-iproute2 package on MLNX_OFED.

This is known to work fine and not fail on the RHEL7.x kernel, regardless of the userland package used (including the RHEL8.x userland).

Workaround:N / D

Keywords:NetDev, RHEL, virtual functions

Discovered at startup: 5.6-1.0.3.5

3040350

Description:
  1. When offloading is enabled, removing a physical port from the ovs-dpdk bridge requires a restart of the OVS service. Otherwise, the data path rules will be misconfigured.
  2. When offloading is enabled, the physical port must be connected to a bridge.

Workaround:

  1. Removing a physical port from an ovs-dpdk bridge with offloading enabled requires a reboot of the openv switch after bringing it back online.
  2. Bridge the physical port according to the desired topology.
Keywords:OVS-DPDK, Bridge, Download

Discovered at startup: 5.6-1.0.3.5

2973726

Description:dec_ttl only works with ConnectX-6. Does not work with ConnectX-5.

Workaround:N / D

Keywords:OVS-DPDK, dez_ttl

Discovered at startup: 5.6-1.0.3.5

2946873

Description:Switching to switchdev mode during namespace removal can cause a deadlock.

Workaround:Download the mlx5_ib module before going into switchdev mode.

Keywords:as soon as possible2, switchdev, Namensraum

Discovered at startup: 5.6-1.0.3.5

2811957

Description:If a system is running from a network boot and is connected to network storage via an NVIDIA ConnectX card, downloading the mlx5_core driver (e.g. by running /etc/init.d/openibd restart) will restart the system make it unusable and is therefore avoided.

Workaround:N / D

Keywords:Installation, mlx5_core

Discovered at startup: 5.6-1.0.3.5

2979243

Description:The CentOS 7.6alt kernel (for non-x86 architectures) differs from RHEL 7.6alt. Some of the MLNX_OFED kernel modules built for the RHEL7.6alt kernel will not load on a system with the Centos7.6alt kernel. If you want to install MLNX_OFED on such a system, you must use ./mlnxofedinstall --add-kernelsupport to rebuild the kernel modules for the Centos kernel.

Workaround:Use the kernel support plugin.

Keywords:Installation, CentOS

Discovered at startup: 5.6-1.0.3.5

3011440

Description:On Debian 11.2, Ubuntu 21.10 and Ubuntu 22.04, attempting to install an "exact" metapackage type (such as mlnx-ofed-all-exact or mlnx-ofed-basic-exact) may fail with an mstflint version-related error.

Workaround:Also, install mstflint in the exact same version (e.g. apt install mlnx-ofed-all-exact mstflint=4.16.0-1.56xxxx).

Keywords:Installation, Debian, Ubuntu, MST

Discovered at startup: 5.6-1.0.3.5

3024520

Description:The --copy-ifnames-udev option copies some files to /etc (/etc/udev/rules.d/82-net-setup-link.rules and /etc/infiniband/vf-net-link-name.sh) which are never removed, not if that option is not provided and not at uninstall. These scripts are just examples. These are files in /etc that must be maintained by the user.

Workaround:If necessary, delete files.

Keywords:Installation

Discovered at startup: 5.6-1.0.3.5

3046601

Description:Rebuilding kernel modules (--add-kernel-support) for some kernel versions (mainline 4.14 in particular) did not properly disable LDFLAGS. The xpmem rebuild may fail in this case with an error like "unrecognized option '-Wl,-z,relro'" in the xpmem build log.

Workaround:Disable xpmem compilation by adding --sin-xpmem to the command line, or edit the kernel makefile to deconfigure LDFLAGS:

sed -i -e '/^exportar ARCH/iLDFLAGS :=' /lib/modules/$(uname -r)/Makefile

Monitoring:The makefile can be located elsewhere, for example in the top-level directory of the kernel source directory.

Keywords:Installation, SLES

Discovered at startup: 5.6-1.0.3.5

3046655

Description:A package manager update with zypper (on a SLES system) may raise a vendor change issue from "Mellanox Technologies" to "OpenFabrics".

Workaround:Accept when prompted or add the file /etc/zypp/vendors.d/mlnx_ofed with the following content:

[main]fornecedores = Mellanox,OpenFabrics
Keywords:Installation, SLES

Discovered at startup: 5.6-1.0.3.5

3048411

Description:After installing OFED with modified kernel modules, error messages appear stating that the mlx5_ib kernel module could not be loaded (eg "mlx5_ib: unknown symbol..."). These messages can safely be ignored as the module will eventually load.
Workaround:Run the dracut -f command to update the initramfs.
Keywords:Installation

Discovered at startup: 5.6-1.0.3.5

3048444

Description:OFED installation failed when using yum for the --add-kernel-support option (packaging without KMP enabled) when the libfabric package is installed.
Workaround:Remove the libfabric package before installing OFED or use the installation script.
Keywords:Installation, RHEL 8.5

Discovered at startup: 5.6-1.0.3.5

3015210

Description:The OVS topology, where the tunnel device is on a VF and the VF delegate is connected to a link, is not supported.
Workaround:N / D
Keywords:as soon as possible2, ConnectX-6 Dx, tunnel over VF, LAG, connection tracing

Discovered at startup: 5.6-1.0.3.5

3028300

Description:OVS measurement is not supported in kernel 5.17.
Workaround:N / D
Keywords:as soon as possible2,OVS, Meter, Core 5.17

Discovered at startup: 5.6-1.0.3.5

3044255

Description:Destroying mlxdevm group while SF is mounted is not supported.
Workaround:N / D
Keywords:as soon as possible2, mlxdevm, QoS, group, scalable features, ConnectX-6 Dx

Discovered at startup: 5.6-1.0.3.5

3046456

Description:Fast switching between SwitchDev mode and Legacy mode in BlueField-2 may prevent the driver from loading correctly and break driver state recovery.
Workaround:Wait 60 seconds between state change commands to ensure driver integrity recovery completes successfully.
Keywords:as soon as possible2, BlueField-2, restore health

Discovered at startup: 5.6-1.0.3.5

2934149

Description:Adding vDPA ports on ConnectX 5 devices in ovs-dpdk is not supported and will fail.
Workaround:N / D
Keywords:OVS-DPDK, ConnectX-5

Discovered at startup: 5.6-1.0.3.5

2901514

Description:Relaxed ordering doesn't work properly in virtual functions.
Workaround:N / D
Keywords:Relaxed request, VF

Discovered at startup: 5.6-1.0.3.5

Referee. Intern numbercompany

2688191

Description:The lower Tx rate limit is not supported at a connection speed of 1 Gb/s.
Workaround:N / D
Keywords:Speed ​​limit, 1 Gb/s

Discovered at startup:5.4-1.0.3.0

2870299

Description:SF management is only possible with the iproute2 tool using mlxdevm.
Workaround:N / D
Keywords:scalable functions

Discovered at startup:5.5-1.0.3.2

2869722

Description:OFED packages were compiled with DKMS disabled because OFED compilation with DKMS failed due to a DKMS package issue in UOS. The --dkms flag should not be used.
Workaround:N / D
Keywords:Installation, DKMS

Discovered at startup:5.5-1.0.3.2

2851639

Description:Enabling ARFS in legacy mode and then switching to switchdev mode is not supported and may result in undesirable behavior.
Workaround:N / D
Keywords:Network Development, ARFS

Discovered at startup:5.5-1.0.3.2

2851639

Description:nvme and iser are not enabled on UOS ARM due to incompatibility with the UOS kernel.
Workaround:N / D
Keywords:nvme, iser, armuos

Discovered at startup:5.5-1.0.3.2

2860855

Description:OFED compilation on RHEL 8.4 with kmp disabled and installing with yum fails due to some conflicting packages.
Workaround:Remove the libfabric and librpmem packages before installing OFED, or add the --allowerasing option to the installation command.
Keywords:Installation, RHEL 8.4, kmp, hmm

Discovered at startup:5.5-1.0.3.2

2865983

Description:OFED packages were built with kmp disabled. Compilation with kmp enabled fails due to missing packages.
Workaround:N / D
Keywords:Installation, km

Discovered at startup:5.5-1.0.3.2

Referee. Intern numbercompany

2658644

Description:Only the lower 32-bit match of ct_label is supported.
Workaround:N / D
Keywords:as soon as possible2, connection tracking

Discovered at startup:5.4-1.0.3.0

2706345

Description:The number of RQ and TIR allocations in the controller depends on the total number of allocated MSI-X vectors. The total number of IRRs supported by the device is 16,000. Each rep requires a number of CPUs per IRR, up to a maximum of 128.
Workaround:To use a large number of VFs, set PF_NUM_PF_MSIX to a smaller value, around 32.
Keywords:as soon as possible2,VF, PF_NUM_PF_MSIX

Discovered at startup:5.4-1.0.3.0

2836997

Description:A self-test that verifies that the flow meter's rate variation stays within a fixed limit (e.g. 10%) may fail because the accuracy of the meter depends on several factors (i.e. the rate and peak readings and the shape of the meter) . . of the counter). Traffic).

To choose the best configuration parameters for a flow meter, run a few test measurements with different burst size values ​​versus the expected traffic load and average the results over a long period of time (tens of minutes).

Workaround:N / D
Keywords:as soon as possible2, meter limit

Discovered at startup:5.4-1.0.3.0

2863456

Description:SA limit per packet count (hard and soft) is only supported for traffic originating from ECPF. Attempting to set them to VF traffic will drop the SA when the hard limit is reached; However, due to the tunneling shift used in such a configuration, traffic can still pass in clear text.
Workaround:N / D
Keywords:as soon as possible2, Full IPsec Download

Discovered at startup:5.4-0.5.1.1

2657392

Description:Installing OFED caused CIFS to fail on RHEL8.4 and RHEL8.5. Added dummy module to disable CIFS after installing OFED on RHEL8.4 and RHEL8.5.
Workaround:N / D
Keywords:Installation, RHEL8.4, RHEL8.5, CIFS

Discovered at startup:5.4-0.5.1.1

2800993

Description:OpenMPI does not support running on other operating systems and/or CPU architectures.
Workaround:N / D
Keywords: OpenMPI

2399503

Description:Öpen vSwitch is not compatible with newer operating systems that only support Python3.
Workaround:N / D
Keywords:Python, ochange pen

2657392

Description:Installing OFED caused CIFS to fail on RHEL8.4. Added dummy module to disable CIFS after installing OFED on RHEL8.4.
Workaround:N / D
Keywords:Installation, RHEL8.4, CIFS

Discovered at startup:5.4-0.5.1.1

2782406

Description:Running yum update updates kylin-release to a higher version. This package version is used for kylin10sp2 detection, so the script detects kylin 10 instead of kylin10sp2 and accidentally uses its repository.
Workaround:Since there are no special cases for kylin10sp2, you can use the detected repository by adding --add-kernel-support to the installation command.
Keywords:Update, kylin

Discovered at startup:5.4-3.0.3.0

2755632

Description:For dual port cards with SR-IOV, if one port link is configured for InfiniBand and the other port link is configured for Ethernet, the Ethernet port cannot support VST and QinQ.
Workaround:N / D
Keywords:SR-IOV, VST, QinQ

Discovered at startup:5.4-3.0.3.0

2780436

Description:A non-default MTU (>1500) does not support IPsec encryption offload and may result in packet loss.
Workaround:N / D
Keywords:IPsec, offload encryption, MTU

Discovered at startup:5.4-3.0.3.0

2726021

Description:Building packages in openEuler with kmp enabled requires the kernel-rpm-macros package to be installed. kernel-rpm-macros-30-13.oe1 does not support the -p option and kernel-rpm-macros-30-18.oe1 must be installed instead.
On the Kylin OS, the kernel-rpm-macros package version does not support the -p option required to support kmp, so it is left disabled.
Workaround:N / D
Keyword: meInstallation, openEuler

Discovered at startup:5.4-3.0.3.0

Referee. Intern numbercompany

2750653

Description:Running fragmented traffic on RHEL 8.3 (4.18.0-240.el8.x86_64) may cause trace calls to build_skb.
Workaround:Update for RHEL 8.3 z-stream 4.18.0-240.22.1.el8_3.x86_64.
Keywords:RHEL 8.3, Kernel Panic, Anrufverfolgung, fr

Discovered at startup:5.4-1.0.3.0

2629375

Description:Matching on the CT tag is only supported when matching on the lower 32 bits. A full match on all 128 bits of the CT tag is not supported.
Workaround:N / D
Keywords:as soon as possible2, Connection Trace Label

Discovered at startup:5.4-1.0.3.0

2707997

Description:Installing in package manager mode on SLES 15.x may require user intervention if the original libibverbs are installed.
Workaround:zypper install --force-resolver mlnx-ofed-all
Keywords:Installation, libibverbs

Discovered at startup:5.4-1.0.3.0

2708531

Description:Installing in package manager mode on SLES 15.x may require user intervention if the original libopenvswitch is installed.
Workaround:zypper install --force-resolver mlnx-ofed-all
Keywords:Installation

Discovered at startup:5.4-1.0.3.0

2703043

Description:Congested TCP blocking for download traffic from kTLS TX devices impacts performance.
Workaround:Disable TCP selective acknowledgment: echo 0 > /proc/sys/net/ipv4/tcp_sack
Keywords:kTLS-TX

Discovered at startup:5.4-1.0.3.0

2676405

Description:If the interface rename package is active (e.g. on XenServer), OFED interface renaming is not performed to eliminate conflicts.
Workaround:N / D
Keywords:Interface name change

Discovered at startup:5.4-1.0.3.0

2687943

Description:Offloading rules that redirect from VFs on one PF to VFs on the second PF are not supported on directly attached devices.

Workaround:N / D

Keywords:as soon as possible2, direct socket

Discovered at startup:5.4-1.0.3.0

2678672

Description:Disabling switchdev mode keeps the qdisc on the tunnel device from being destroyed and OVS keeps calling mlx5e_stats_flower() resulting in a NULL pointer panic and a memory leak.

Workaround:N / D

Keywords:SwitchDev, mlx5, Tunnelverkehr

Discovered at startup:5.4-1.0.3.0

2566548

Description:When EEH is enabled on PPC systems, when performing the FW sync reset (using mlxfwreset with the --sync flag 1 or using the devlink dev fw_activate reload action), the EEH can capture the PCI reset and start the transfer in take possession. In the case of multiple executions in a row, the EEH can also decide to deactivate the device.

Workaround:The administrator can disable EEH before resetting firmware sync on the device.

Keywords:PCP, EEH

Discovered at startup:5.4-1.0.3.0

2617950

Description:The TX port timestamp feature is supported in kernel versions 3.15 and later. In earlier kernel versions, the function is not supported and the ptp_tx<X>_* counters are not increased.

Workaround:N / D

Keywords:Ettool

Discovered at startup:5.4-1.0.3.0

2390731

Description:Ethtool doesn't show advertised port speeds/capacity over 100Gb/s over and under 5.0 kernels even if it's supported.

Workaround:N / D

Keywords:Ethtool port speed

Discovered at startup:5.4-1.0.3.0

Referee. Intern numbercompany

2585575

Description:After disabling sync reset by setting enable_remote_dev_reset to false, performing firmware sync reset can sometimes cause a General Protection Fault and the system may hang.

Workaround:N / D

Keywords:Firmware Update

Discovered at startup:5.3-1.0.0.1

2582565

Description:Performing a firmware reset or unbinding PF in switchdev mode can cause a kernel crash.

Workaround:N / D

Keywords:SwitchDev, as soon as possible2, Unpair, reset firmware

Discovered at startup:5.3-1.0.0.1

2587802

Description:PTP synchronization can be lost when using the private flag tx_port_ts.

Workaround:Toggle private flag:
ethtool --set-priv-flags <si> tx_port_ts disabled
ethtool --set-priv-flags <si> tx_port_ts de
Restart the ptp4l application

Keywords:PTP-Synchronisation

Discovered at startup:5.3-1.0.0.1

2574943

Description:When running kernel 5.8 and earlier or RHEL 8.2 and earlier, the sample packages do not support tunnel information.
Workaround:N / D
Keywords:as soon as possible2, sFLUSS

Discovered at startup: 5.3-1.0.0.1

2568417

Description:After upgrading to version 5.3, the package manager tool installs new packages and removes old packages, a depmod WARNING appears in mlx5_fpga_tools. This warning can safely be ignored. mlx5_fpga_tools is a module that existed in version 5.2 and was removed in 5.3.
Workaround:N / D
Keywords:Melhoria; mlx5_fpga_tools

Discovered at startup: 5.3-1.0.0.1

2506425

Description:When installing kmod packages on EulerOS 2.0SP9 or OpenEuler 20.03 the following error appears: "modprobe: FATAL: not to get mod versions from <directory>". This error can safely be ignored. It is caused by incorrectly adding directories to a list of modules managed by /usr/sbin/weak-modules.
Workaround:N / D
Keywords:Furnishings; modules; kmod

Discovered at startup: 5.3-1.0.0.1

2492509

Description:When installing the driver on OpenEuler or EulerOS 2.0SP9, rebuilding the drivers (--add-kernel-support) with the --kmp option (to create kmod packages) results in packages that cannot be installed because they from " /sbin / depmod ", which the system does not provide. This dependency is created by a buggy kmod package generation tool included with the distribution.
Workaround:N / D
Keywords:Plugin kernel support

Discovered at startup: 5.3-1.0.0.1

2479327

Description:If the kernel has been updated to 4.12.14-122.46 on SLES 12 SP5, it is not possible to rebuild kernel modules (--add-kernel-support) without updating gcc to at least 4.8.5-31.23.2 .
Workaround:N / D
Keywords:Improvement; LES 12; Add-on core support

Discovered at startup: 5.3-1.0.0.1

2584441

Description:If the kernel has been updated to 4.12.14-122.46 on SLES 12 SP5, it is not possible to rebuild kernel modules (--add-kernel-support) without updating gcc to at least 4.8.5-31.23.2 .
Workaround:N / D
Keywords:Improvement; LES 12; Add-on core support

Discovered at startup: 5.3-1.0.0.1

2460865

Description:By setting the MTU to low values, such as 68 bytes, packets may fail due to oversize.
Workaround:N / D
Keywords:PERSONA

Discovered at startup: 5.3-1.0.0.1

2383318

Description:On RedHat 7.2-based kernels, the tx_port_ts feature as defined by ethtool – set-priv-flags is disabled.
Workaround:N / D
Keywords:Roter Hut; tx_port_ts

Discovered at startup: 5.3-1.0.0.1

2575647

Description:OvS-DPDK errors can occur during live migration for virtual machines using Virtio interfaces accelerated with OvS-DPDK vDPA ports.
Workaround:N / D
Keywords:OVS-DPDK vDPA, Live-Migration

Discovered at startup: 5.3-1.0.0.1

Referee. Intern numbercompany

2395082

Description:When switching from SwitchDev mode to Legacy mode in kernel v5.9, there may be call tracking due to a kernel issue in tcf_block_unbind.
Workaround:N / D
Keywords:as soon as possible2;ChangeDev; Anrufverfolgung; Ader; tcf_block_unbind

Discovered at startup: 5.2-1.0.4.0

Referee. Intern numbercompany

2209987

Description:The aRFS feature (enabled with ethtool ntuple on) is disabled for kernel 4.1 or earlier.
Workaround:N / D
Keywords:aRFS

Discovered at startup: 5.1-1.0.4.0

2248996

Description:Firmware version downgrade for ConnectX-6 cards with "installar --fw-update-only --force-fw-update"Cases.
Workaround:Manually downgrade the firmware version - seeFirmware update instructions.
Keywords:Firmware, ConnectX-6

Discovered at startup:5.1-1.0.4.0

2175930

Description:When using MLNX_ENv5.1 on PPC architectures with v5.5 or v5.6 kernel and an older ethtool utility, a harmless warning call trace may appear in dmesg due to user kernel incompatibility. Notification call tracking mentions ethtool_notify.
Workaround:Update the ethtool utility to version 5.6 on these systems to prevent call tracking.
Keywords:ppc,ethtool_notify, Kernel

Discovered at startup:5.1-1.0.4.0

2198764

Description:If MLNX_EN is installed on a Debian or Ubuntu system running in a chroot environment, the openibd service will not be activated. If chroot files are used as the basis of a full system, the openibd service will be disabled.
Workaround:Openibd is currently a sysv-init script that you can manually enable by running:update-rc.d openibd-Standards
Keywords:Chroot, Debian, Ubuntu, openibd

Discovered at startup:5.1-1.0.4.0

2237134

Description:Running Connection Trace (CT) on the FW address may cause the CREATE_FLOW_TABLE command to fail with the syndrome.

Workaround:Configure OVS to use a single controller thread:

#ovs-vsctl setze Open_vSwitch. other_config:n-handler-threads=1

Keywords:Connection tracking, ASAP, OVS, FW address

Discovered at startup:5.1-1.0.4.0

2239894

Description:Running the OpenVSwitch offload with high traffic throughput can result in a low insert rate due to high CPU usage.
Workaround:Reduce the number of combined uplink channels with "ethtool -L".
Keywords:Insertion fee, ASAP2

Discovered at startup:5.1-1.0.4.0

2240671

Description:The header rewrite action is not supported on RHEL/CentOS 7.4.
Workaround:N / D
Keywords:ASAP, Header Rewrite, RHEL, RedHat, CentOS, OS

Discovered at startup:5.1-1.0.4.0

2242546

Description:Tunnel download (encap/decap) can cause a kernel panic if the nf_tables module is not tested.
Workaround:Make sure you check the nf_tables module before entering rules.
Keywords:Kernel v5.7, ASAP, Kernel-Panic

Discovered at startup:5.1-1.0.4.0

2143007

Description:IPsec packets are dropped during heavy traffic due to a bug in the Linux kernel net/xfrm.
Workaround:Make sure the kernel is modified to apply the following patch: "xfrm: fixed double insertion of ESP trailer when downloading IPsec encryption".
Keywords:IPsec, xfrm

Discovered at startup:5.1-1.0.4.0

2225952

Description:VF mirroring with TC skip_sw policy is not supported on RHEL/CentOS 7.4, 7.5 and 7.6 operating systems.
Workaround:N / D
Keywords:as soon as possible2, Mirroring, RHEL, RedHat, OS

Discovered at startup:5.1-1.0.4.0

2216521

Description:After upgrading MLNX_EN from v5.0 or earlier, the ibdev2netdev utility changes the installation prefix to /usr/sbin. Therefore, it cannot be found in the same shell environment.
Workaround:AfterInstall MLNX_EN, log out and back in to update the SHELL environment.
Keywords:ibdev2netdev

Discovered at startup:5.1-1.0.4.0

2202520

Description:Rules that combine VLAN push/drop, encapsulate/decapsulate, and header rewrite actions are not supported.
Workaround:N / D
Keywords:as soon as possible2, SwitchDev, VLAN-Push/Pop, Encap/Decap, Header-Umschreibung

Discovered at startup:5.1-1.0.4.0

2210752

Description:Switching from Legacy mode to SwitchDev mode and vice versa while TC rules are present on the NIC fails.
Workaround:Before trying to switch modes, make sure you remove any TC rules on the NIC or stop OpenvSwitch.
Keywords:as soon as possible2, Devlink, SR-IOV-Heredado

Discovered at startup:5.1-1.0.4.0

2125036/2125031

Description:Upgrading MLNX_EN from a UPSTREAM_LIBS based version to a MLNX_LIBS based version will fail unless the driver is uninstalled and reinstalled.
Workaround:Be sure to uninstall and reinstall MLNX_EN to complete the upgrade.
Keywords:Installation , UPSTREAM_LIBS , MLNX_LIBS

Discovered at startup:5.1-1.0.4.0

2105447

Description:Warning messages from hns_roce appear in dmesg after reboot on Euler2 SP3 operating systems.
Workaround: N / D
Keywords:hns_years, dmesg, Euler

Discovered at startup:5.1-1.0.4.0

2112251

Description:In kernel 4.10-4.14, when the far end of the Geneve tunnel is defined with IPv6, packets larger than MTU are not fragmented, resulting in no traffic being sent.

Workaround:Set up the far end of the Geneve tunnel with IPv4.
Keywords:Kernel, Geneva, IPv4, IPv6, MTU, Fragmentation

Discovered at startup:5.1-1.0.4.0

2102902

Description:On the RH8.0-4.18.0-80.el8.x86_64 operating system, a kernel panic may occur when opening the kTLS offload connection due to a bug in the kernel TLS stack.
Workaround:N / D
Keywords:Download TLS, mlx5e

Discovered at startup:5.1-1.0.4.0

2111534

Description:On the Ubuntu19.04-5.0.0-38-generic operating system, a kernel panic may occur when opening the kTLS offload connection due to a bug in the kernel TLS stack.
Workaround:N / D
Keywords:Download TLS, mlx5e

Discovered at startup:5.1-1.0.4.0

Referee. Intern numbercompany

2094176

Description:At full scale in VF-LAG mode, the bandwidth can be unstable.
Workaround:N / D
Keywords:VF DELAY

Discovered at startup:5.0-1.0.0.0

2044544

Description:When working with an operating system with kernel v4.10, the binding engine does not support configuring MTUs greater than 1500 on a binding interface.
Workaround:Please update your kernel version to v4.11 or higher.
Keywords:Link, PERSON, Core

Discovered at startup:5.0-1.0.0.0

1882932

Description:The libibverbs dependencies are removed during OFED installation, which requires manual installation of libraries, which OFED does not reinstall.
Workaround:Install the missing packages manually.
Keywords:libibverbs, Installation

Discovered at startup:5.0-1.0.0.0

2058535

Description:The ibdev2netdev command returns duplicate devices with different ports in SwitchDev mode.
Workaround:Use the /opt/mellanox/iproute2/sbin/rdma link show command instead.
Keywords:ibdev2netdev

Discovered at startup:5.0-1.0.0.0

2072568

Description:no RHEL/CentOS 7.2 operating systems, add delete rules whenact_gactnot loaded can cause a kernel crash.
Workaround:Preload all necessary modules to avoid such a scenario (cls_flower, act_mirred, act_gact, act_tunnel_key and act_vlan).
Keywords:RHEL/CentOS 7.2, kernel 4.9, call trace, ASAP

Discovered at startup:5.0-1.0.0.0

2093698

Description:The VF LAG setting is not supported if the NUM_OF_VFS configured in mlxconfig is greater than 64.
Workaround:N / D
Keywords:VF LAG, SwitchDev mode, ASAP

Discovered at startup:5.0-1.0.0.0

2093746

Description:Devlink state dumps are not compatible with kernels prior to v5.3.
Workaround:N / D
Keywords:Devlink status report dump

Discovered at startup:5.0-1.0.0.0

2083427

Description:For kernels with connection tracing support, neighbor update events are not supported, which requires users to have static ARP to work with OVS and VxLAN.
Workaround:N / D
Keywords:VxLAN, VF LAG, Neighbor, ARP

Discovered at startup:5.0-1.0.0.0

2067012

Description:MLNX_EN cannot be installed on Debian 9.11 operating system in SwitchDev mode.
Workaround:Install OFED with flag--add-kernel-support.
Keywords:ASAP SwitchDev Debian-Kernel

Discovered at startup:5.0-1.0.0.0

2036572

Description:When using a thread domain and the non-blocking rdma-core ibv_post_send path, there is an additional CPU penalty due to the required barriers around the device's MMIO buffer, which are omitted in MLNX_EN.
Workaround:N / D
Keywords:rdma-core, write combo, MMIO buffer

Discovered at startup:5.0-1.0.0.0

Referee. Intern numbercompany
-Description:The argparse module is installed by default in Python versions =>2.7 and >=3.2. If an older version of Python is used, the argparse module is not installed by default.
Workaround:Install the argparse module manually.
Keywords:Python, MFT, argparse, installation
Discovered at startup: 4.7-3.2.9.0

1997230

Description:Running mlxfwreset or unloading the mlx5_core module while unloading contrak streams can cause kernel call traces.
Workaround:Stop the OVS service before calling mlxfwreset or unloading the mlx5_core module.
Keywords:Contrak, ASAP, OVS, mlxfwrest, Descargar
Discovered at startup: 4.7-3.2.9.0

1955352

description: It is not supported to switch 2 ports in parallel to SwitchDev mode.
alternative solution: N/T
Keywords:ASAP SwitchDev
Discovered at startup: 4.7-3.2.9.0

1979958

Description:IPv6 VxLAN offloading is not supported on CentOS/RHEL v7.2 operating systems.
Workaround:N / D
Keywords:Tunnel, VXLAN, ASAP, IPv6
Discovered at startup: 4.7-3.2.9.0

1991710

Description:Setting PRIO_TAG_REQUIRED_EN is not supported and may result in call tracking.
Workaround:N / D
Keywords:ASAP, PRIOR_TAG, mstconfig
Discovered at startup: 4.7-3.2.9.0

1967866

Description:Enabling ECMP offloading requires the VFs to be offline and the VMs to be powered off.
alternative solution: N/T
Keywords:ECMP, Multipath, ASAP2
Discovered at startup: 4.7-3.2.9.0

1821235

Description:When using the mlx5dv_dr API to create streams, the metadata C records are nulled for streams that perform the encapsulate or push vlan action.
Workaround:Use both actions at the end of the flow process.
Keywords:flow direction
Discovered at startup: 4.7-1.0.0.1

1921981

Description:On Ubuntu, Debian and RedHat 8 and higher OSS, parsing the mfa2 file with mstarchive may result in a segmentation error.
Workaround:Instead, use mlxarchive to parse the mfa2 file.
Keywords:MFT, mfa2, mstarchive, mlxarchive, Ubuntu, Debian, RedHat, OS
Discovered at startup: 4.7-1.0.0.1

1840288

Description:MLNX_EN does not support XDP functionality in the RedHat 7 operating system, despite RedHat's declared support.

Workaround:N / D
Keywords:XDP, RedHat
Discovered at startup: 4.7-1.0.0.1

1892663

Description:The mlnx_tune script is not compatible with the Python3 interpreter.
Workaround:Only run mlnx_tune with the python2 interpreter.
Keywords:mlnx_tune, python3, python2
Discovered at startup:4.7-1.0.0.1
Referee. Intern numbercompany

1753629

Description:A binding bug found in the 4.12 and 4.13 kernels can cause a slave to become permanently stuckBOND_LINK_FAILTo express. As a result, the following message may appear in the dmesg:

link: link status for interface eth1 failed, failed in 100ms

Workaround:N / D
Keywords:bond, slave
Discovered at startup: 4.6-1.0.1.1

1712068

Description:Uninstalling MLNX_EN will automatically uninstall several libraries included in MLNX_EN package such as: B. the InfiniBand related libraries.
Workaround:If you need these libraries, reinstall them using your local package manager (yum/dnf).
Keywords:MLNX_EN Libraries
Discovered at startup: 4.6-1.0.1.1
-Description:Due to library changes, MFT v4.11.0 and earlier are not compatible with MLNX_EN v4.6-1.0.0.0 and later.

Therefore, with MLNX_EN v4.6-1.0.0.0 and later, it is recommended to use MFT v4.12.0 and later.

Workaround:N / D
Keywords:Compatible with MFT
Discovered at startup: 4.6-1.0.1.1

1730840

Description:em ConnectX-4 HCA, GID index for RoCE v2 is inconsistent when switching between enabled and disabled interface mode.
Workaround:N / D
Keywords:RoCE v2, GID
Discovered at startup: 4.6-1.0.1.1

1717428

description: On kernel 4.10-4.14, MTUs greater than 1500 cannot be configured for a GRE interface with any drivers (IPv4 or IPv6).
alternative solution: Update your kernel to a version higher than v4.14.
Keywords:Fedora 27, gretap, ip_gre, ip_tunnel, ip6_gre, ip6_tunnel
Discovered at startup: 4.6-1.0.1.1

1748343

description: It takes a few minutes to charge the controller when there is a large amount of VF.
alternative solution: N/T
keywords: VF, SR-IOV
Discovered at startup: 4.6-1.0.1.1

1733974

description: Running heavy traffic (e.g. "Ping Flood") when enabling and disabling other mlx5 interfaces may result in "INFO: found rcu_preempt stuck on cpu/tasks:call traces.
alternative solution: N/T
keywords:mlx5
Discovered at startup: 4.6-1.0.1.1
-description: On HCAs ConnectX-6 and later, an attempt to set the advertisement (any bitmap) results in all resources being advertised.
Workaround:N / D
Keywords:200 GbE, Propaganda, Ettool
Discovered at startup: 4.6-1.0.1.1
Referee. Intern numbercompany

581631

description: GID entries referenced by a specific User Application cannot be removed while that User Application is running.

alternative solution: N/T

keywords: RoCE, GID

Discovered at startup: 4,5-1.0.1.0

1403313

description: Attempting to allocate an excessive amount of VF per PF on operating systems with kernel versions earlier than v4.15 may fail due to a known kernel issue.

alternative solution: Make sure to update your kernel version to v4.15 or later.

keywords: VF, PF, IOMMU, Núcleo, SO

Discovered at startup: 4,5-1.0.1.0

1521877

descriptionNote: On SLES 12 SP1 operating systems, a kernel trackpoint issue can cause undefined behavior when inserting a kernel module with an incorrect parameter.

alternative solution: N/T

keywords: mlx5-Controller, SLES 12 SP1

Discovered at startup: 4,5-1.0.1.0

Referee. Intern numbercompany

504073

descriptionNote: When using ConnectX-5 with LRO systems over PPC, the HCA may experience a back pressure due to delayed PCI writes. In this case, bandwidth can drop from line rate to ~35 Gb/s. Packet loss or dropped frames can also be observed.

alternative solution: Look for a PCI backpressure indicator ("outbound_pci_stalled_wr" counter in ethtools advance). Disabling LRO helps reduce back pressure and its effects.

keywords: Flusskontrolle, LRO

Discovered at startup: 4.4-1.0.0.0

1424233

description: On RHEL v7.3, 7.4 and 7.5 operating systems, configuring IPv4 IP forwarding will disable LRO on existing interfaces. Manually re-enabling LRO with ethtool and adding a VLAN interface can result in warning call tracking.

alternative solution- Make sure that IPv4 IP forwarding and LRO are not enabled at the same time.

keywords: IPv4 forwarding, LRO

Discovered at startup: 4.4-1.0.1.0

1442507

description: Retpoline support in GCC causes an increase in CPU usage, resulting in a 15% performance drop in IP forwarding.

alternative solution: N/T

keywords: Retpoline, GCC, CPU, IP forwarding, Specter attack

Discovered at startup: 4.4-1.0.1.0

1425129

description: MLNX_EN cannot be installed on SLES 15 operating systems using the Zypper repository.

alternative solution: Install MLNX_EN using the default installation script instead of the zypper repository.

keywords: Installation, SLES, Zypper

Discovered at startup: 4.4-1.0.1.0

1241056

description: When working with ConnectX-4/ConnectX-5 HCA on PPC systems with hardware LRO and Adaptive Rx support, bandwidth drops from Full Wire Rate (FWS) to ~60 Gb/s.

alternative solution: Be sure to disable Adaptive Rx when enabling Hardware LRO:ethtool -C <interfaz> adaptive-rx disabled

ethtool -C <interfaz> rx-usecs 8 rx-frames 128

keywords: Hardware-LRO, Adaptive Rx, PPC

Discovered at startup: 4.3-1.0.1.0

1090612

description: The NVMEoF protocol does not support LBA format with non-zero metadata length. Therefore, NVMe namespaces configured in LBA format with a metadata length greater than 0 result in Enhanced Error Handling (EEH) on PowerPC systems.

alternative solution: Configure the NVMe namespace to use the LBA format with zero-length metadata.

keywords: NVMEoF, PowerPC, EEH

Discovered at startup: 4.3-1.0.1.0

1309621

description: In the default switchdev mode configuration, stateless addresses/downloads based on internal headers are not supported.

alternative solution: To enable stateless direction/downloads based on inner headers, disable encap by running:

devlink dev eswitch show pci/0000:83:00.1 encap deshabilitar

Or, if the kernel doesn't support devlink, run:

echo nenhum > /sys/kernel/debug/mlx5/<BDF>/compat/encap

monitoring: This is a hardware limitation.

keywords: switchdev, stateless dump, address

Discovered at startup: 4.3-1.0.1.0

1275082

descriptionNote: When configuring a non-default IPv6 link-local address or an address that is not based on the MAC device, the connection establishment via RoCEv2 may fail.

alternative solution: N/T

keywords: IPV6, RoCE, Link-Local-Adresse

Discovered at startup: 4.3-1.0.1.0

1307336

description: In RoCE LAG mode, while runningibdev2netdev -v, the status of the second port of the mlx4_0 IB device is "NA" because this IB device does not have a second port.

alternative solution: N/T

keywords: mlx4, RoCE LAG, ibdev2netdev, enlace

Discovered at startup: 4.3-1.0.1.0

1296355

description: The total number of MSI-Xs that can be allocated to VF and PF is limited to 2300 on Power9 platforms.

alternative solution: N/T

keywords: MSI-X, VF, PF, PPC, SR-IOV

Discovered at startup: 4.3-1.0.1.0

1259293

description: On Fedora 20 operating systems, loading the driver fails with an error message like the following: "[185.262460] kmem_cache_sanity_check (fs_ftes_0000:00:06.0): cache name already exists.

This is because SLUB assignments bundle multiple kmem_cache_create boards into one board cache alias to save memory and increase cache performance. This causes the tile name to become obsolete.

alternative solution: Update the kernel version to kernel-3.19.8-100.fc20.x86_64.

Note that after rebooting you will need to rebuild with the new kernel
MLNX_EN against the new kernel version.

keywords: Fedora, load drivers

Discovered at startup: 4.3-1.0.1.0

1264359

description: Running perftest (ib_send_bw, ib_write_bw, etc.) in rdma-cm mode may increase the resp_cqe_error counter in /sys/class/infiniband/mlx5_0/ports/1/hw_counters/resp_cqe_error. This behavior is expected and is the result of receiving WQEs that have not been consumed.

alternative solution: N/T

keywords: most perfect, RDMA CM, mlx5

Discovered at startup: 4.3-1.0.1.0

1264956

descriptionNote: Configuring SR-IOV after disabling RoCE LAG via sysfs (/sys/bus/pci/drivers/mlx5_core/<bdf>/roce_lag_enable) may result in RoCE LAG being re-enabled when SR-IOV -Configuration fails.

alternative solution: Be sure to disable RoCE LAG again.

keywords: RoCE LAG, SR-IOV

Discovered at startup: 4.3-1.0.1.0

Referee. Intern numbercompany

1263043

description: On RHEL7.4, due to an OS issue introduced in kmod package version 20-15.el7_4.6, parsing of depmod configuration files fails, resulting in one of the following issues:

  • Resetting the controller fails and shows an error message like: "ERROR: module mlx5_core belongs to core and is not part of MLNX_EN, Ignore...
  • Malfunction of the nvmet_rdma kernel module despite installing MLNX_EN with the "--with-nvmf" option. An error message like: "nvmet_rdma: Unknown parameter "offload_mem_start" ignoredwill be featured in itdmesgSalida

alternative solution: Go toRedHat pageto update the kmod package version.

keywords: reset drivers, kmod, kmp, nvmf, nvmet_rdma

Discovered at startup: 4.2-1.2.0.0

-

description: Packet size limitation (actual MTU of the packet) for IPsec offload on Innova IPsec adapter cards: The current offload implementation does not support IP fragmentation. The size of the original packet must be such that it does not exceed the MTU size of the interface after ESP transformation (encryption of the original IP packet, which increases its length) and added headers (outer IP header):

  • Internal IP packet size <= I/F MTU - ESP adds (20) - External_IP (20) - Fragmentation problem Reserved length (56)
  • Interne IP-Paketgröße <= I/F MTU - 96

This mainly affects traffic forwarded with a lower MTU and UDP traffic. By default, TCP performs PMTU detection and sets the MSS accordingly.

alternative solution: N/T

keywords: Innova IPsec, MTU

Discovered at startup: 4.2-1.0.1.0

-

description: LLC/SNAP is not compatible with Innova IPsec adapter cards.

alternative solution: N/T

keywords: Innova IPsec, LLC/SNAP

Discovered at startup: 4.2-1.0.1.0

-

description: FEC is not supported on Innova IPsec adapter cards. When using switches, it may be necessary to change their settings.

alternative solution: N/T

keywords: Innova IPsec, FEC

Discovered at startup: 4.2-1.0.1.0

955929

description: Heavy traffic can cause SYN floods when using Innova IPsec adapter cards.

alternative solution: N/T

keywords: Innova IPsec, SYN-Flut

Discovered at startup: 4.2-1.0.1.0

-

description: Priority-based flow control is not supported on Innova IPsec adapter cards.

alternative solution: N/T

keywords: Innova IPsec, priority-based flow control

Discovered at startup: 4.2-1.0.1.0

-

description: Stop configuration is not supported when using Innova IPsec adapter cards. The default pause is the global pause (enabled).

alternative solution: N/T

keywords: Renew IPsec, global pause

Discovered at startup: 4.2-1.0.1.0

1045097

description: Plugging and unplugging a cable multiple times can cause connection errors when using Innova IPsec adapter cards.

alternative solution: N/T

keywords: Innova IPsec, cable, connection

Discovered at startup: 4.2-1.0.1.0

-

description: On Innova IPsec adapter cards, the supported MTU is between 512 and 2012 bytes. Setting MTU values ​​outside of this range may fail or result in traffic loss.

alternative solution: Set the MTU between 512 and 2012 bytes.

keywords: Innova IPsec, MTU

Discovered at startup: 4.2-1.0.1.0

1125184

description: On older kernel versions like Ubuntu 14.04 and RedHat 7.1, the VXLAN interface does not respond to ARP requests for an existing MAC address in its own ARP table. This issue has been fixed in the following latest kernel versions: Ubuntu 16.04 and RedHat 7.3.

alternative solution: N/T

keywords: ARP, VXLAN

Discovered at startup: 4.2-1.0.1.0

1134323

descriptionNote: Using kernel versions prior to 4.7 with IOMMU enabled may experience performance degradation and logical issues (e.g. temporary crashes) under heavy traffic loads. This is due to the fact that IOMMU-IOVA allocations are centralized, which requires many synchronization operations and high blocking overhead between CPUs.

alternative solution: Use kernel v4.7 or later or a supported kernel that includes the following patches:

  • 2aac630429d9 iommu/vt-d: altere intel-iommu para usar números de quadros IOVA
  • 9257b4a206fc iommu/iova: Introduce cache per CPU for iova allocation
  • 22e2f9fa63b0 iommu/vt-d: Use IOVA cache for CPU

keywords: IOMMU, bloqueo polite

Discovered at startup: 4.2-1.0.1.0

1135738

description: With a page size setting of 64 KB, the DMA memory may run out when trying to increase the ring size/number of channels.

alternative solution: Reduce ringtone size/number of channels.

keywords: DMA, 64K page

Discovered at startup: 4.2-1.0.1.0

1159650

description: When configuring VF VST, outgoing VLAN tagged packets are dropped in case of ConnectX-4 HCA. In the case of ConnectX-5 HCAs, an additional VLAN tag is inserted for outgoing packets with VLAN tags.

alternative solution: N/T

keywords: VST

Discovered at startup: 4.2-1.0.1.0

1157770

description: On relatively old passthrough/VM machines with QEMU and libvirtd,

CMD may time out after loading the driver.

After the timeout, no more commands are executed and all controller operations are blocked.

alternative solution: Update QEMU and libvirtd on the KVM server.

Tested with (Ubuntu 16.10) are the following versions:

  • libvirt 2.1.0
  • QEMU 2.6.1

keywords: QEMU

Discovered at startup: 4.2-1.0.1.0

1147703

description: Using dm-multipath for high availability over NVMEoF block devices must be done with the "directio" path verifier.

alternative solution: N/T

keywords: NVMEoF

Discovered at startup: 4.2-1.0.1.0

1152408

description: RedHat v7.3 PPCLE and v7.4 PPCLE operating systems do not support qemu KVM by default. Running it gives the following error messagevirt-installTo create new virtual machines:

I can't find the qemu-kvm package to install

alternative solution: Purchase the following beta RPMs from 7.4ALT to 7.3/7.4 PPCLE (in the same order):

  • qemu-img-.el7a.ppc64le.rpm
  • qemu-kvm-common-.el7a.ppc64le.rpm
  • qemu-kvm-.el7a.ppc64le.rpm

keywords: Virtualization, PPC, Power8, KVM, RedHat, PPC64LE

Discovered at startup: 4.2-1.0.1.0

1012719

description: At very high load of GSI QP (RDMA-CM applications) there may be a slight hang in the CQ polling stream. This is a temporary situation from which the driver will later recover.

alternative solution: N/T

keywords: RDMA-CM, GSI-QP, CQ

Discovered at startup: 4.2-1.0.1.0

1078630

descriptionNote: When working on RoCE LAG on Kernel v3.10, the kernel may fail to download the driver while Network Manager is running.

alternative solution: Please exit Network Manager before downloading the driver and restart it after the driver download is complete.

keywords: RoCE LAG, Administrator de red

Discovered at startup: 4.2-1.0.1.0

1149557

descriptionNote: When configuring VGT+, the maximum number of VLAN IDs allowed to appear in sysfs is 813 (up to the first 813).

alternative solution: N/T

keywords: VGT+

Discovered at startup: 4.2-1.0.1.0

Referee. Intern numbercompany

995665/1165919

description: In kernels prior to v4.13, the connection between the NVMEoF host and the target fails on a hyperthreaded system with more than 1 socket.

alternative solution: Connect to the NVMEoF subsystem on the host side--nr-io-queues <number_queues>Bandera.

look at thatnumber_queuesmust be less than or equal tonum_socketsmultiplied by num_cores_per_socket.

keywords: NVMEoF

1039346

description: Enabling multiple namespaces per subsystem when using NVMEoF target offload is not supported.

alternative solution: To enable more than one namespace, create a subsystem for each.

keywords: NVMEoF target swap, namespace

1030301

description: Creating virtual roles on a device in LAG mode destroys the LAG configuration. Device binding through the Ethernet NICs continues to work as expected.

alternative solution: N/T

keywords: LAG, SR-IOV

1047616

descriptionNote: If a device's node GUID is set to zero (0000:0000:0000:0000), the RDMA_CM userland application may crash.

alternative solution: Set the GUID of the node to a non-zero value.

keywords: RDMA_CM

1051701

descriptionNote: New versions of iproute that support new kernel features may work poorly on old kernels that do not support these new features.

alternative solution: N/T

keywords: iproute

1007830

description: If you are working on the Xenserver hypervisor with SR-IOV enabled, make sure the following statements apply:

  1. Immediately after enabling SR-IOV, unplug all virtual function controller instances from their PCI slots.
  2. It is not allowed to detach the instance from the PF controller while VFs are active.

alternative solution: N/T

keywords: SR-IOV

1005786

description: When using ConnectX-5 adapter cards, the following error may be reported to dmesg indicating a temporary lack of DMA pages:

"mlx5_core... give_pages:289:(pid x): The allocation time for page Y exceeded the maximum allowed time

mlx5_core ... page_notify_fail:263:(pid x): Page allocation failure notification sent to fw in func_id(z).

mlx5_core ... pages_work_handler:471:(pid x): dar falha -12”

example: This can happen when trying to open more than 64 VFs per port.

alternative solution: N/T

keywords: mlx5_core, DMA

1008066/1009004

Description: Performing some user-side operations during reboot may lead to call tracking/panic due to bugs found in the Linux kernel.

For example: Runget_vf_stats(via iptool) during reboot.

Problemumgehung: N/A

Keywords: mlx5_core, reboot

1009488

description: Mounting MLNX_EN in a path that contains special characters such as brackets or spaces is not supported. For example, if you mount MLNX_EN at /media/CDROM(vcd)/, the installation fails with the following error message:

# cd /medios/CDROM\(vcd\)/

# ./Install on PC

sh: 1: syntax error: "(" unexpected

alternative solution: N/T

keywords: built-in

982144

description- When Download Traffic Sniffer is enabled, your bandwidth can decrease by up to 50%.

alternative solution: N/T

keywords: Download traffic tracker

981362

description: Setting a TC number via the tc tool is not supported on many operating systems.

alternative solution: Set the TC number via the sysfs file /sys/class/net/<interface>/qos/tc_num.

keywords: Ethernet, CT

979457

description: Setting IOMMU=ON can cause severe performance degradation due to a bug in IOMMU.

alternative solution: Make sure the following patches are in your kernel:

  • iommu/vt-d: correct mapping of PASID table
  • iommu/vt-d: IOMMU search solution for SR-IOV virtual functions

monitoring: These patches are already available on Ubuntu 16.04.02 and 17.04 operating systems.

keywords: performance, IOMMU

References

Top Articles
Latest Posts
Article information

Author: Prof. An Powlowski

Last Updated: 17/08/2023

Views: 6080

Rating: 4.3 / 5 (44 voted)

Reviews: 83% of readers found this page helpful

Author information

Name: Prof. An Powlowski

Birthday: 1992-09-29

Address: Apt. 994 8891 Orval Hill, Brittnyburgh, AZ 41023-0398

Phone: +26417467956738

Job: District Marketing Strategist

Hobby: Embroidery, Bodybuilding, Motor sports, Amateur radio, Wood carving, Whittling, Air sports

Introduction: My name is Prof. An Powlowski, I am a charming, helpful, attractive, good, graceful, thoughtful, vast person who loves writing and wants to share my knowledge and understanding with you.