fix: Only try SNI slicing at offset 0 #2436

larseggert · 2025-02-11T09:42:53Z

And also check that the SNI length makes sense.

This should fix the spurios failures for compatible_upgrade_large_initial.

And also check that the SNI length makes sense. This should fix the spurios failures for `compatible_upgrade_large_initial`.

codecov · 2025-02-11T09:50:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.27%. Comparing base (9734cf2) to head (bca7ada).
Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2436      +/-   ##
==========================================
- Coverage   95.28%   95.27%   -0.02%     
==========================================
  Files         114      115       +1     
  Lines       37111    37188      +77     
  Branches    37111    37188      +77     
==========================================
+ Hits        35363    35432      +69     
- Misses       1742     1750       +8     
  Partials        6        6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2025-02-11T10:30:37Z

Failed Interop Tests

QUIC Interop Runner, client vs. server, differences relative to db807a9.

neqo-latest as client

neqo-latest vs. aioquic: ⚠️Z
neqo-latest vs. go-x-net: BP BA
neqo-latest vs. haproxy: ⚠️BP BA
neqo-latest vs. kwik: Z 3 🚀C1 V2 BP BA
neqo-latest vs. lsquic: ⚠️L1 C1
neqo-latest vs. msquic: ⚠️Z A L1 L2 C1
neqo-latest vs. mvfst: A L1 C1 BP BA
neqo-latest vs. nginx: ⚠️BP BA
neqo-latest vs. ngtcp2: CM
neqo-latest vs. picoquic: A L1 🚀C1
neqo-latest vs. quic-go: A 🚀~~BP BA~~
neqo-latest vs. quiche: BP BA
neqo-latest vs. s2n-quic: ⚠️BP BA CM
neqo-latest vs. tquic: S BP BA
neqo-latest vs. xquic: ⚠️A

neqo-latest as server

aioquic vs. neqo-latest: ⚠️CM
go-x-net vs. neqo-latest: ⚠️CM
kwik vs. neqo-latest: ⚠️BP BA CM
lsquic vs. neqo-latest: ⚠️CM
msquic vs. neqo-latest: ⚠️Z U CM
mvfst vs. neqo-latest: Z A L1 C1 CM
openssl vs. neqo-latest: ⚠️LR M 6 CM
quic-go vs. neqo-latest: run cancelled after 20 min
quiche vs. neqo-latest: ⚠️CM
quinn vs. neqo-latest: ⚠️L1 V2 CM
s2n-quic vs. neqo-latest: ⚠️CM
tquic vs. neqo-latest: ⚠️CM
xquic vs. neqo-latest: ⚠️M CM

All results

Succeeded Interop Tests

QUIC Interop Runner, client vs. server

neqo-latest as client

neqo-latest vs. aioquic: 🚀~~H DC LR C20 M S R 3 B U A L1 L2 C1 C2 6 V2 BP BA~~
neqo-latest vs. go-x-net: H DC LR M B U A L2 C2 6
neqo-latest vs. haproxy: 🚀~~H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6 V2~~
neqo-latest vs. kwik: H DC LR C20 M S R B U A L1 L2 🚀C1 C2 6
neqo-latest vs. lsquic: 🚀~~H DC LR C20 M S R Z 3 B U E A L2 C2 6 V2 BP BA~~
neqo-latest vs. msquic: 🚀~~H DC LR C20 M S R B U C2 6 V2 BP BA~~
neqo-latest vs. mvfst: H DC LR M R Z 3 B U L2 C2 6
neqo-latest vs. neqo: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
neqo-latest vs. neqo-latest: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
neqo-latest vs. nginx: 🚀~~H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6~~
neqo-latest vs. ngtcp2: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA
neqo-latest vs. picoquic: H DC LR C20 M S R Z 3 B U E L2 🚀C1 C2 6 V2 BP BA
neqo-latest vs. quic-go: H DC LR C20 M S R Z 3 B U L1 L2 C1 C2 6 🚀~~BP BA~~
neqo-latest vs. quiche: H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6
neqo-latest vs. quinn: 🚀~~H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 BP BA~~
neqo-latest vs. s2n-quic: 🚀~~H DC LR C20 M S R 3 B U E A L1 L2 C1 C2 6~~
neqo-latest vs. tquic: H DC LR C20 M R Z 3 B U A L1 L2 C1 C2 6
neqo-latest vs. xquic: 🚀~~H DC LR C20 M R Z 3 B U L1 L2 C1 C2 6 BP BA~~

neqo-latest as server

aioquic vs. neqo-latest: 🚀~~H DC LR C20 M S R Z 3 B A L1 L2 C1 C2 6 V2 BP BA~~
chrome vs. neqo-latest: 3
go-x-net vs. neqo-latest: 🚀~~H DC LR M B U A L2 C2 6 BP BA~~
kwik vs. neqo-latest: 🚀~~H DC LR C20 M S R Z 3 B U A L1 L2 C1 C2 6 V2~~
lsquic vs. neqo-latest: 🚀~~H DC LR M S R 3 B E A L1 L2 C1 C2 6 V2 BP BA~~
msquic vs. neqo-latest: 🚀~~H DC LR C20 M S R B A L1 L2 C1 C2 6 V2 BP BA~~
mvfst vs. neqo-latest: H DC LR M 3 B L2 C2 6 BP BA
neqo vs. neqo-latest: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
ngtcp2 vs. neqo-latest: 🚀~~H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM~~
openssl vs. neqo-latest: 🚀~~H DC C20 S R 3 B A L2 C2 BP BA~~
picoquic vs. neqo-latest: H DC LR C20 M S R Z 3 B U E A L1 L2 C1 C2 6 V2 BP BA CM
quiche vs. neqo-latest: 🚀~~H DC LR M S R Z 3 B A L1 L2 C1 C2 6 BP BA~~
quinn vs. neqo-latest: H DC LR C20 M S R Z 3 B U E A ⚠️L1 L2 C1 C2 6 BP BA
s2n-quic vs. neqo-latest: 🚀~~H DC LR M S R 3 B E A L1 L2 C1 C2 6 BP BA~~
tquic vs. neqo-latest: 🚀~~H DC LR M S R Z 3 B A L1 L2 C1 C2 6 BP BA~~
xquic vs. neqo-latest: 🚀~~H DC LR C20 S R Z 3 B U A L1 L2 C1 C2 6 BP BA~~

Unsupported Interop Tests

QUIC Interop Runner, client vs. server

neqo-latest as client

neqo-latest vs. aioquic: E CM
neqo-latest vs. go-x-net: C20 S R Z 3 E L1 C1 V2 CM
neqo-latest vs. haproxy: E CM
neqo-latest vs. kwik: E CM
neqo-latest vs. lsquic: CM
neqo-latest vs. msquic: 3 E CM
neqo-latest vs. mvfst: C20 S E V2 CM
neqo-latest vs. nginx: E V2 CM
neqo-latest vs. picoquic: CM
neqo-latest vs. quic-go: E V2 CM
neqo-latest vs. quiche: E V2 CM
neqo-latest vs. quinn: V2 CM
neqo-latest vs. s2n-quic: Z V2
neqo-latest vs. tquic: E V2 CM
neqo-latest vs. xquic: S E V2 CM

neqo-latest as server

aioquic vs. neqo-latest: U E
chrome vs. neqo-latest: H DC LR C20 M S R Z B U E A L1 L2 C1 C2 6 V2 BP BA CM
go-x-net vs. neqo-latest: C20 S R Z 3 E L1 C1 V2
kwik vs. neqo-latest: E
lsquic vs. neqo-latest: C20 Z U
msquic vs. neqo-latest: 3 E
mvfst vs. neqo-latest: C20 S R U E V2
openssl vs. neqo-latest: Z U E L1 C1 V2
quiche vs. neqo-latest: C20 U E V2
s2n-quic vs. neqo-latest: C20 Z U V2
tquic vs. neqo-latest: C20 U E V2
xquic vs. neqo-latest: E V2

mxinden · 2025-02-11T13:34:45Z

neqo-transport/src/shuffle.rs

+        let len = buf.len();
+
+        assert!(buf[len - 23] == 0x00 && buf[len - 22] == 0x0c); // Check Server Name List length
+                                                                 // Set Server Name List length to 0


Bad formatting?

It's what rustfmt generates...

I'm more concerned about the constants you have littered around. 22 and 23 here, 15 and 39 above. Still, I can't see how that would be easy to fix.

github-actions · 2025-02-11T13:55:55Z

Benchmark results

Performance differences relative to d21c121.

decode 4096 bytes, mask ff: No change in performance detected.

       time:   [12.317 µs 12.356 µs 12.401 µs]
       change: [-0.4865% +0.0248% +0.5095%] (p = 0.93 > 0.05)
Found 13 outliers among 100 measurements (13.00%)

1 (1.00%) low severe

2 (2.00%) low mild

10 (10.00%) high severe

decode 1048576 bytes, mask ff: No change in performance detected.

       time:   [2.8369 ms 2.8500 ms 2.8668 ms]
       change: [-1.3138% -0.1710% +0.7399%] (p = 0.78 > 0.05)
Found 10 outliers among 100 measurements (10.00%)

1 (1.00%) low mild

9 (9.00%) high severe

decode 4096 bytes, mask 7f: No change in performance detected.

       time:   [20.842 µs 20.893 µs 20.950 µs]
       change: [-0.4852% -0.0816% +0.3468%] (p = 0.70 > 0.05)
Found 15 outliers among 100 measurements (15.00%)

2 (2.00%) low severe

2 (2.00%) low mild

11 (11.00%) high severe

decode 1048576 bytes, mask 7f: No change in performance detected.

       time:   [4.5388 ms 4.5499 ms 4.5625 ms]
       change: [-0.4903% -0.0912% +0.3116%] (p = 0.65 > 0.05)
Found 13 outliers among 100 measurements (13.00%)

1 (1.00%) low mild

12 (12.00%) high severe

decode 4096 bytes, mask 3f: No change in performance detected.

       time:   [8.2694 µs 8.3006 µs 8.3379 µs]
       change: [-1.2261% -0.5416% +0.1235%] (p = 0.13 > 0.05)
Found 12 outliers among 100 measurements (12.00%)

4 (4.00%) low mild

8 (8.00%) high severe

decode 1048576 bytes, mask 3f: No change in performance detected.

       time:   [1.5874 ms 1.5929 ms 1.5998 ms]
       change: [-0.5193% +0.0011% +0.6042%] (p = 0.98 > 0.05)
Found 8 outliers among 100 measurements (8.00%)

3 (3.00%) high mild

5 (5.00%) high severe

coalesce_acked_from_zero 1+1 entries: No change in performance detected.

       time:   [91.397 ns 91.739 ns 92.075 ns]
       change: [-1.0561% -0.1376% +0.6514%] (p = 0.78 > 0.05)
Found 14 outliers among 100 measurements (14.00%)

12 (12.00%) high mild

2 (2.00%) high severe

coalesce_acked_from_zero 3+1 entries: No change in performance detected.

       time:   [109.72 ns 110.03 ns 110.36 ns]
       change: [-0.4000% -0.0167% +0.3604%] (p = 0.94 > 0.05)
Found 13 outliers among 100 measurements (13.00%)

1 (1.00%) low mild

2 (2.00%) high mild

10 (10.00%) high severe

coalesce_acked_from_zero 10+1 entries: No change in performance detected.

       time:   [109.19 ns 109.46 ns 109.85 ns]
       change: [-1.1485% -0.2899% +0.4251%] (p = 0.52 > 0.05)
Found 14 outliers among 100 measurements (14.00%)

5 (5.00%) low severe

2 (2.00%) low mild

3 (3.00%) high mild

4 (4.00%) high severe

coalesce_acked_from_zero 1000+1 entries: No change in performance detected.

       time:   [93.519 ns 98.999 ns 110.82 ns]
       change: [-0.6119% +2.1434% +6.3312%] (p = 0.35 > 0.05)
Found 8 outliers among 100 measurements (8.00%)

1 (1.00%) high mild

7 (7.00%) high severe

RxStreamOrderer::inbound_frame(): No change in performance detected.

       time:   [112.27 ms 112.41 ms 112.65 ms]
       change: [-0.1823% +0.0176% +0.2537%] (p = 0.89 > 0.05)
Found 7 outliers among 100 measurements (7.00%)

4 (4.00%) low mild

2 (2.00%) high mild

1 (1.00%) high severe

SentPackets::take_ranges: No change in performance detected.

       time:   [5.2666 µs 5.4236 µs 5.5907 µs]
       change: [-1.4466% +1.5877% +4.8881%] (p = 0.32 > 0.05)
Found 8 outliers among 100 measurements (8.00%)

6 (6.00%) high mild

2 (2.00%) high severe

transfer/pacing-false/varying-seeds: Change within noise threshold.

       time:   [34.296 ms 34.360 ms 34.427 ms]
       change: [-0.7381% -0.4754% -0.2037%] (p = 0.00 < 0.05)
Found 1 outliers among 100 measurements (1.00%)

1 (1.00%) high severe

transfer/pacing-true/varying-seeds: Change within noise threshold.

       time:   [34.317 ms 34.371 ms 34.425 ms]
       change: [-1.0936% -0.8680% -0.6506%] (p = 0.00 < 0.05)
Found 3 outliers among 100 measurements (3.00%)

2 (2.00%) low mild

1 (1.00%) high mild

transfer/pacing-false/same-seed: Change within noise threshold.

       time:   [34.331 ms 34.387 ms 34.443 ms]
       change: [-0.6402% -0.4159% -0.2103%] (p = 0.00 < 0.05)
Found 1 outliers among 100 measurements (1.00%)

1 (1.00%) high mild

transfer/pacing-true/same-seed: No change in performance detected.

       time:   [34.772 ms 34.838 ms 34.918 ms]
       change: [-0.0637% +0.2185% +0.5077%] (p = 0.14 > 0.05)
Found 3 outliers among 100 measurements (3.00%)

1 (1.00%) low mild

1 (1.00%) high mild

1 (1.00%) high severe

1-conn/1-100mb-resp/mtu-1504 (aka. Download)/client: No change in performance detected.

       time:   [858.04 ms 868.21 ms 878.66 ms]
       thrpt:  [113.81 MiB/s 115.18 MiB/s 116.55 MiB/s]
change:
       time:   [-1.7169% -0.0437% +1.6344%] (p = 0.96 > 0.05)
       thrpt:  [-1.6081% +0.0437% +1.7469%]

1-conn/10_000-parallel-1b-resp/mtu-1504 (aka. RPS)/client: No change in performance detected.

       time:   [317.56 ms 320.89 ms 324.22 ms]
       thrpt:  [30.844 Kelem/s 31.164 Kelem/s 31.490 Kelem/s]
change:
       time:   [-1.0825% +0.4369% +1.9325%] (p = 0.57 > 0.05)
       thrpt:  [-1.8958% -0.4350% +1.0943%]

1-conn/1-1b-resp/mtu-1504 (aka. HPS)/client: Change within noise threshold.

       time:   [25.325 ms 25.467 ms 25.611 ms]
       thrpt:  [39.046  elem/s 39.266  elem/s 39.486  elem/s]
change:
       time:   [-2.0365% -1.1938% -0.2958%] (p = 0.01 < 0.05)
       thrpt:  [+0.2967% +1.2082% +2.0788%]

1-conn/1-100mb-resp/mtu-1504 (aka. Upload)/client: No change in performance detected.

       time:   [1.8331 s 1.8522 s 1.8714 s]
       thrpt:  [53.437 MiB/s 53.989 MiB/s 54.553 MiB/s]
change:
       time:   [-1.3182% +0.0496% +1.4620%] (p = 0.95 > 0.05)
       thrpt:  [-1.4410% -0.0496% +1.3358%]

Client/server transfer results

Performance differences relative to d21c121.

Transfer of 33554432 bytes over loopback, 30 runs. All unit-less numbers are in milliseconds.

Client	Server	CC	Pacing	Mean ± σ	Min	Max	Δ `main`	Δ `main`
neqo	neqo	reno	on	520.2 ± 62.2	456.6	726.7	-14.3	-0.7%
neqo	neqo	reno		571.4 ± 173.4	448.6	1150.4	13.1	0.6%
neqo	neqo	cubic	on	536.3 ± 36.9	477.8	645.6	-5.9	-0.3%
neqo	neqo	cubic		534.7 ± 29.2	468.8	589.2	7.6	0.4%
google	neqo	reno	on	879.7 ± 95.0	625.4	975.5	-3.5	-0.1%
google	neqo	reno		885.4 ± 105.5	628.4	1070.3	-8.8	-0.2%
google	neqo	cubic	on	885.2 ± 99.9	630.0	1122.3	2.0	0.1%
google	neqo	cubic		880.4 ± 98.8	652.9	1086.0	3.0	0.1%
google	google			546.4 ± 35.6	519.8	680.6	1.1	0.1%
neqo	msquic	reno	on	232.9 ± 37.7	203.7	397.6	6.3	0.7%
neqo	msquic	reno		233.9 ± 44.9	203.2	398.6	9.7	1.1%
neqo	msquic	cubic	on	225.8 ± 30.9	200.6	372.4	-8.8	-1.0%
neqo	msquic	cubic		226.8 ± 32.6	200.3	364.3	-2.9	-0.3%
msquic	msquic			116.6 ± 17.6	100.3	171.8	4.2	0.9%

⬇️ Download logs

neqo-transport/src/shuffle.rs

martinthomson · 2025-02-11T22:02:51Z

neqo-transport/src/shuffle.rs

+        let len = buf.len();
+
+        assert!(buf[len - 23] == 0x00 && buf[len - 22] == 0x0c); // Check Server Name List length
+                                                                 // Set Server Name List length to 0


I'm more concerned about the constants you have littered around. 22 and 23 here, 15 and 39 above. Still, I can't see how that would be easy to fix.

neqo-transport/src/shuffle.rs

martinthomson · 2025-02-11T22:06:54Z

neqo-transport/src/crypto.rs

@@ -1575,7 +1575,7 @@ impl CryptoStreams {
        let Some((offset, data)) = cs.tx.next_bytes() else {
            return;
        };
-        let written = if sni_slicing {
+        let written = if sni_slicing && offset == 0 {


Good! Test?

I'm struggling a bit how to structure that test. Got any suggestions?

So in our ordinary setup with splitting, we'll send out two packets. If we lose the right one of those (I think we reorder, so we'd want to lose the first), we'll have delivered the first part of the handshake data. Then, we'll be forced to retransmit, which means that the offset will be non-zero when we hit this code again.

To verify that we've not split, we can verify that we get just one frame, not two. Does that work?

It almost works, except for the verification bit, because that RTX'ed chunk of crypto frame will not look like a valid CH and so the slicing logic will always exit and leave one slice, even if it was called. We'd need to feed in something at a non-zero offset that is a valid-looking CH. Hm...

Right, so the test would not have failed prior to you adding this check.

Except that it would have, just not reliably. And I think that's OK. If the test won't fail without this check, I'm sure that cargo mutants will complain and we can dismiss that, but I'd rather have the test and not have it reliably able to catch the problem than not have anything.

Thinking more on this, the odds of hitting this issue is probably higher than you would think. We look for a more or less fixed ClientHello shape. There's a few things where we skip vectors, but there we only need to ensure that we don't overrun the buffer, so there will be pure noise that will pass those tests. The real check is where we hit the SNI extension parsing. If we have something that is [0, 0, x, y] where x*256+y is between 3 and the remaining length of the packet, we will treat that as SNI. (We do not check the SNI type or inner length, so this is looser than you might think.) We also get a few swipes at that, jumping forward every time we see [?, ?, x, y].

So while there's a good chance that we'll skip past the end of the buffer and believe there is no SNI, the odds of us hitting this sequence is not as low as I had thought. It's not as high as I'd like, but with a padding extension in play still, it's going to be non-trivial.

Co-authored-by: Martin Thomson <[email protected]> Signed-off-by: Lars Eggert <[email protected]>

martinthomson · 2025-02-13T02:42:07Z

neqo-transport/src/crypto.rs

-                )
+        while let Some((offset, data)) = cs.tx.next_bytes() {
+            let written = if sni_slicing && offset == 0 {
+                qdebug!("XXX SNI slicing enabled");


martinthomson · 2025-02-13T02:42:35Z

neqo-transport/src/shuffle.rs

@@ -145,4 +147,19 @@ mod tests {
        let buf = [1; 1];
        assert!(super::find_sni(&buf).is_none());
    }
+
+    #[test]


I'd stick some commentary in here to cover this better.

fix: Only try SNI slicing at offset 0

a6a5a76

And also check that the SNI length makes sense. This should fix the spurios failures for `compatible_upgrade_large_initial`.

larseggert requested review from KershawChang, martinthomson and mxinden as code owners February 11, 2025 09:42

Add test

bdbbb64

Merge branch 'main' into fix-sni-slicing

5e3d8a0

mxinden reviewed Feb 11, 2025

View reviewed changes

martinthomson reviewed Feb 11, 2025

View reviewed changes

larseggert and others added 5 commits February 12, 2025 08:37

Update neqo-transport/src/shuffle.rs

11f7082

Co-authored-by: Martin Thomson <[email protected]> Signed-off-by: Lars Eggert <[email protected]>

Update neqo-transport/src/shuffle.rs

3d2a780

Co-authored-by: Martin Thomson <[email protected]> Signed-off-by: Lars Eggert <[email protected]>

Update neqo-transport/src/shuffle.rs

5e3c8ed

Co-authored-by: Martin Thomson <[email protected]> Signed-off-by: Lars Eggert <[email protected]>

Suggestions from @martinthomson

45a7a54

Add test

bca7ada

martinthomson approved these changes Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Only try SNI slicing at offset 0 #2436

fix: Only try SNI slicing at offset 0 #2436

larseggert commented Feb 11, 2025

codecov bot commented Feb 11, 2025 •

edited

Loading

github-actions bot commented Feb 11, 2025 •

edited

Loading

Succeeded Interop Tests

neqo-latest as client

neqo-latest as server

Unsupported Interop Tests

neqo-latest as client

neqo-latest as server

mxinden Feb 11, 2025

larseggert Feb 11, 2025

martinthomson Feb 11, 2025

github-actions bot commented Feb 11, 2025 •

edited

Loading

martinthomson Feb 11, 2025

martinthomson Feb 11, 2025

larseggert Feb 12, 2025

martinthomson Feb 12, 2025

larseggert Feb 12, 2025

martinthomson Feb 13, 2025

martinthomson Feb 13, 2025

martinthomson Feb 13, 2025

fix: Only try SNI slicing at offset 0 #2436

Are you sure you want to change the base?

fix: Only try SNI slicing at offset 0 #2436

Conversation

larseggert commented Feb 11, 2025

codecov bot commented Feb 11, 2025 • edited Loading

Codecov Report

github-actions bot commented Feb 11, 2025 • edited Loading

Failed Interop Tests

neqo-latest as client

neqo-latest as server

Succeeded Interop Tests

neqo-latest as client

neqo-latest as server

Unsupported Interop Tests

neqo-latest as client

neqo-latest as server

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Feb 11, 2025 • edited Loading

Benchmark results

Client/server transfer results

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Feb 11, 2025 •

edited

Loading

github-actions bot commented Feb 11, 2025 •

edited

Loading

github-actions bot commented Feb 11, 2025 •

edited

Loading