aboutgitcodebugslistschat
path: root/ndp.c
diff options
context:
space:
mode:
authorStefano Brivio <sbrivio@redhat.com>2021-07-26 07:18:50 +0200
committerStefano Brivio <sbrivio@redhat.com>2021-07-26 07:18:50 +0200
commit17765f8de0782de09ebdf79940f934b8ccb83c41 (patch)
tree11cc42c19a2b694b66dde7e377ba78e2107fd62a /ndp.c
parent0be49ccd93186600e40b8bffe867d18c4d16366a (diff)
downloadpasst-17765f8de0782de09ebdf79940f934b8ccb83c41.tar
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.tar.gz
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.tar.bz2
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.tar.lz
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.tar.xz
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.tar.zst
passt-17765f8de0782de09ebdf79940f934b8ccb83c41.zip
checksum: Introduce AVX2 implementation, unify helpers
Provide an AVX2-based function using compiler intrinsics for TCP/IP-style checksums. The load/unpack/add idea and implementation is largely based on code from BESS (the Berkeley Extensible Software Switch) licensed as 3-Clause BSD, with a number of modifications to further decrease pipeline stalls and to minimise cache pollution. This speeds up considerably data paths from sockets to tap interfaces, decreasing overhead for checksum computation, with 16-64KiB packet buffers, from approximately 11% to 7%. The rest is just syscalls at this point. While at it, provide convenience targets in the Makefile for avx2, avx2_debug, and debug targets -- these simply add target-specific CFLAGS to the build. Signed-off-by: Stefano Brivio <sbrivio@redhat.com>
Diffstat (limited to 'ndp.c')
-rw-r--r--ndp.c5
1 files changed, 3 insertions, 2 deletions
diff --git a/ndp.c b/ndp.c
index acc0473..b676825 100644
--- a/ndp.c
+++ b/ndp.c
@@ -27,6 +27,7 @@
#include <net/if.h>
#include <net/if_arp.h>
+#include "checksum.h"
#include "util.h"
#include "passt.h"
#include "tap.h"
@@ -172,8 +173,8 @@ int ndp(struct ctx *c, struct ethhdr *eh, size_t len)
ip6hr->payload_len = htons(sizeof(*ihr) + len);
ip6hr->hop_limit = IPPROTO_ICMPV6;
ihr->icmp6_cksum = 0;
- ihr->icmp6_cksum = csum_ip4(ip6hr, sizeof(*ip6hr) +
- sizeof(*ihr) + len);
+ ihr->icmp6_cksum = csum_unaligned(ip6hr, sizeof(*ip6hr) +
+ sizeof(*ihr) + len, 0);
ip6hr->version = 6;
ip6hr->nexthdr = IPPROTO_ICMPV6;