Migrate to encoding/json/v2 #292

inteon · 2025-06-16T06:54:00Z

Replaces the github.com/json-iterator/go dependency with encoding/json/v2.
Performance is not yet great (feel free to push improvements/ create new PRs based on this PR):

# based on 8273db415281d117376643df2325c1fff36a8c41
$ go test -benchmem -run=^$ -bench ^BenchmarkFieldSet/serialize.*$ sigs.k8s.io/structured-merge-diff/v6/fieldpath -count=6 > pr.txt
$ go test -benchmem -run=^$ -bench ^BenchmarkFieldSet/serialize.*$ sigs.k8s.io/structured-merge-diff/v6/fieldpath -count=6 > master.txt
$ benchstat master.txt pr.txt

goos: linux
goarch: amd64
pkg: sigs.k8s.io/structured-merge-diff/v6/fieldpath
cpu: Intel(R) Core(TM) Ultra 7 165H
                            │  master.txt  │               pr.txt               │
                            │    sec/op    │   sec/op     vs base               │
FieldSet/serialize-20-8       5.949µ ±  8%   6.495µ ± 1%   +9.19% (p=0.002 n=6)
FieldSet/deserialize-20-8     18.02µ ±  7%   14.51µ ± 2%  -19.47% (p=0.002 n=6)
FieldSet/serialize-50-8       19.97µ ±  7%   18.68µ ± 7%   -6.47% (p=0.015 n=6)
FieldSet/deserialize-50-8     42.15µ ±  8%   40.97µ ± 3%        ~ (p=0.818 n=6)
FieldSet/serialize-100-8      73.39µ ±  6%   66.18µ ± 7%   -9.83% (p=0.002 n=6)
FieldSet/deserialize-100-8    127.2µ ±  4%   133.7µ ± 6%   +5.09% (p=0.002 n=6)
FieldSet/serialize-500-8      401.3µ ±  7%   357.4µ ± 5%  -10.94% (p=0.002 n=6)
FieldSet/deserialize-500-8    668.2µ ±  6%   681.2µ ± 5%        ~ (p=0.818 n=6)
FieldSet/serialize-1000-8     855.9µ ± 10%   810.0µ ± 4%   -5.35% (p=0.026 n=6)
FieldSet/deserialize-1000-8   1.533m ±  4%   1.494m ± 9%        ~ (p=0.394 n=6)
geomean                       111.5µ         106.5µ        -4.45%

                            │  master.txt   │               pr.txt                │
                            │     B/op      │     B/op      vs base               │
FieldSet/serialize-20-8         2350.0 ± 0%     517.0 ± 0%  -78.00% (p=0.002 n=6)
FieldSet/deserialize-20-8     11.278Ki ± 0%   5.840Ki ± 0%  -48.21% (p=0.002 n=6)
FieldSet/serialize-50-8        6.375Ki ± 0%   1.407Ki ± 0%  -77.93% (p=0.002 n=6)
FieldSet/deserialize-50-8      24.30Ki ± 0%   16.41Ki ± 0%  -32.45% (p=0.002 n=6)
FieldSet/serialize-100-8      20.426Ki ± 0%   4.806Ki ± 0%  -76.47% (p=0.002 n=6)
FieldSet/deserialize-100-8     74.18Ki ± 0%   56.74Ki ± 0%  -23.52% (p=0.002 n=6)
FieldSet/serialize-500-8      112.01Ki ± 1%   24.04Ki ± 0%  -78.54% (p=0.002 n=6)
FieldSet/deserialize-500-8     360.9Ki ± 0%   276.2Ki ± 0%  -23.46% (p=0.002 n=6)
FieldSet/serialize-1000-8     226.75Ki ± 1%   54.03Ki ± 0%  -76.17% (p=0.002 n=6)
FieldSet/deserialize-1000-8    788.7Ki ± 0%   613.0Ki ± 0%  -22.28% (p=0.002 n=6)
geomean                        46.16Ki        18.24Ki       -60.48%

                            │  master.txt  │               pr.txt               │
                            │  allocs/op   │  allocs/op   vs base               │
FieldSet/serialize-20-8         9.000 ± 0%    1.000 ± 0%  -88.89% (p=0.002 n=6)
FieldSet/deserialize-20-8       285.0 ± 0%    206.0 ± 0%  -27.72% (p=0.002 n=6)
FieldSet/serialize-50-8        14.000 ± 0%    1.000 ± 0%  -92.86% (p=0.002 n=6)
FieldSet/deserialize-50-8       832.0 ± 0%    590.0 ± 0%  -29.09% (p=0.002 n=6)
FieldSet/serialize-100-8       32.000 ± 0%    1.000 ± 0%  -96.88% (p=0.002 n=6)
FieldSet/deserialize-100-8     2.784k ± 0%   2.068k ± 0%  -25.72% (p=0.002 n=6)
FieldSet/serialize-500-8      143.000 ± 1%    1.000 ± 0%  -99.30% (p=0.002 n=6)
FieldSet/deserialize-500-8     14.27k ± 0%   10.34k ± 0%  -27.52% (p=0.002 n=6)
FieldSet/serialize-1000-8     307.500 ± 0%    1.000 ± 0%  -99.67% (p=0.002 n=6)
FieldSet/deserialize-1000-8    31.54k ± 0%   22.57k ± 0%  -28.44% (p=0.002 n=6)
geomean                         373.4         47.52       -87.27%

closes #202

dims · 2025-06-17T01:59:33Z

xref: kubernetes/kubernetes#132312

dims · 2025-06-18T00:45:38Z

For the pull-structured-merge-diff-test failure, please add this to fix the error @inteon

diff --git a/internal/cli/main_test.go b/internal/cli/main_test.go
index 3e409ede0673..8beed5c6cf55 100644
--- a/internal/cli/main_test.go
+++ b/internal/cli/main_test.go
@@ -21,6 +21,7 @@ import (
        "encoding/json"
        "io/ioutil"
        "path/filepath"
+       "strings"
        "testing"
 )

@@ -135,7 +136,7 @@ func (tt *testCase) checkOutput(t *testing.T, got []byte) {
                t.Fatalf("couldn't read expected output %q: %v", tt.expectedOutputPath, err)
        }

-       if a, e := string(got), string(want); a != e {
+       if a, e := strings.TrimSpace(string(got)), strings.TrimSpace(string(want)); a != e {
                t.Errorf("output didn't match expected output: got:\n%v\nwanted:\n%v\n", a, e)
        }
 }

inteon · 2025-06-18T10:15:31Z

For the pull-structured-merge-diff-test failure, please add this to fix the error @inteon
...

I fixed the test failure.

dims · 2025-06-18T10:30:41Z

/assign @BenTheElder @liggitt

liggitt · 2025-06-19T16:59:30Z

/assign @jpbetz
who is the primary apimachinery approver on this bit and was deeply involved in the initial performance-driven use of json-iterator in these bits

liggitt · 2025-06-19T17:01:50Z

For the pull-structured-merge-diff-test failure, please add this to fix the error @inteon

I suspect using a json marshal function (like MarshalWrite) that doesn't append a newline would be a more efficient way to accomplish that

liggitt · 2025-06-19T17:02:46Z

fieldpath/serialize-pe.go

+			return nil, fmt.Errorf("parsing JSON: %v", err)
+		}
+
+		k := rawKey.String()


is rawKey.String() the same as decoding to a string, in terms of interpreting escape sequences, etc?

fieldpath/serialize-pe.go

liggitt · 2025-06-19T17:07:25Z

value/reflectcache_test.go

-		{
-			JSON:     `1.0`,
-			IntoType: reflect.TypeOf(json.Number("")),
-			Want:     json.Number("1.0"),
-		},
-		{
-			JSON:     `1`,
-			IntoType: reflect.TypeOf(json.Number("")),
-			Want:     json.Number("1"),
-		},


curious if it's ok to drop these... were they added to try to catch a specific issue?

k8s-ci-robot · 2025-06-20T19:05:14Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: inteon
Once this PR has been reviewed and has the lgtm label, please ask for approval from jpbetz. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

liggitt · 2025-06-23T15:17:41Z

The .../deserialize... benchmarks actually don't look terrible now... I'd be willing to accept that performance drop in pursuit of correctness / safety.

The serialize benchmarks still look pretty rough. Need to see what we can improve there.

liggitt · 2025-06-24T12:59:14Z

did you run the full set of benchmarks to see how we looked across all of them?

liggitt · 2025-07-01T17:33:24Z

Thanks for the updates, how are the overall benchmarks looking (not just the subset in the description)?

As you were adjusting the implementation, were there any unit tests it would make sense to add to catch edges the previous implementations handled we want to ensure the new one does as well? I'm thinking specifically of things like:

handling of extra data in the input bytes/buffer when decoding/deserializing (e.g. "somevalue"extrastuff or {"key":"value"}extrastuff)
handling of special characters in strings that need escaping where the raw bytes would not be the same as the decoded or encoded/escaped bytes
handling of ignorable whitespace when decoding

jpbetz · 2025-07-01T18:22:27Z

First off- it's amazing to see this happening and the benchmarks are VERY promising. Thanks @inteon!

To get this to the finish line, and merge, what should our criteria be?

I chatted offline with @liggitt briefly and some of the criteria we discussed was:

Golang releases a stable json/v2 (The alternative would be to add an internal copy of json-experiment to this repo like kube-openapi has, but I don't know if it's worth it given how close json/v2 is to stable)
github.com/kubernetes/kubernetes CI test stability is not negatively impacted
This passes a scale test (SIG instrumentation)
We are confident on the correctness (triple check the implementation, shore up with additional functional tests)

Intuitively, it seems like the deserialization is already sufficiently fast. I suspect we need to optimize serialization a bit further since we serialize managed fields on all updates (not just patches). That said, I'm willing to be data driven here. If we can show downstream scale and performance is acceptable, I'm willing to accept a higher serialization perf regression in order to migrate to json/v2.

Thoughts, concerns?

Signed-off-by: Tim Ramlot <[email protected]>

inteon · 2025-08-30T12:02:04Z

Update check the new numbers in my PR description, upgrading encoding/json/v2 did improve performance!

inteon · 2025-08-30T15:09:29Z

Did some further tuning and got the # allocations lower than on master.

BenTheElder · 2025-08-30T21:19:29Z

Golang releases a stable json/v2 (The alternative would be to add an internal copy of json-experiment to this repo like kube-openapi has, but I don't know if it's worth it given how close json/v2 is to stable

even with stable json/v2, we might need to temporarily use a fork, otherwise we kubernetes branches that aren't on that go version yet can't update SMD.

but we should encapsulate it and plan to eliminate it when we're ready to require that minimum go version

even kubernetes master isn't on 1.25 yet

liggitt · 2025-09-02T19:30:48Z

Did some further tuning and got the # allocations lower than on master.

Am I reading correctly that B/op and allocs/op are ~equivalent or better than master on pretty much all benchmarks? If so, that's amazing progress!

Paired with a close review and functional/correctness test coverage to make sure the new approach behaves identically to the old version (especially in terms of what it accepts/rejects/produces in edge cases like leading/trailing/non-normalized/invalid inputs), this looks really promising.

lalitc375 · 2025-09-04T22:25:18Z

Amazing work @inteon in reducing the number of allocs per operation to 1. I did a similar analysis over your change, and saw similar performance. The change should have zero to negligible impact on Kube API server performance. We just have to make sure this new library behaves the same as the existing implementation functionally, which I think existing tests should be able to do(?).

liggitt · 2025-09-04T23:02:44Z

We just have to make sure this new library behaves the same as the existing implementation functionally, which I think existing tests should be able to do(?).

I'm not sure how detailed the existing tests are at all the edge cases of valid and invalid variants on input (handling of escaped values in keys, whitespace before/after/between tokens, valid and invalid syntax, etc), and byte-for-byte assertions about output. Since this needed to effectively rewrite some of the encoding/decoding paths, we need to make sure we have test coverage for those things.

Signed-off-by: Tim Ramlot <[email protected]>

lalitc375 · 2025-09-09T18:08:19Z

We just have to make sure this new library behaves the same as the existing implementation functionally, which I think existing tests should be able to do(?).

I'm not sure how detailed the existing tests are at all the edge cases of valid and invalid variants on input (handling of escaped values in keys, whitespace before/after/between tokens, valid and invalid syntax, etc), and byte-for-byte assertions about output. Since this needed to effectively rewrite some of the encoding/decoding paths, we need to make sure we have test coverage for those things.

There are not enough tests for unicode and escape characters . I have added those tests in #300. Including these new tests, We should detect regression in Serialization and Deserialization code in future.

liggitt · 2025-09-09T18:16:35Z

Excellent, #300 looks like a great step forward for test coverage of normalized encoding. We'll probably want similar additions for:

decoding of valid-but-non-normalized values working properly (insignificant leading / trailing / interspersed whitespace, or non-canonical escaped values, etc) and capturing what in-memory values are produced by the decoding
decoding of invalid values erroring properly

inteon · 2025-09-11T10:00:57Z

After upgrading github.com/go-json-experiment/json and rerunning the benchmarks, all benchmarks now outperform the benchmarks on master.

liggitt · 2025-09-11T11:44:02Z

Huh… did something change on s-m-d master? The latest benchmark update looks like some of the relative improvement came from master getting worse...

liggitt · 2025-09-11T12:30:14Z

oh, maybe the test changes in #300 impacted the master benchmark numbers

dims · 2025-09-17T21:11:42Z

k/k master is at golang v1.25.1 fyi

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jun 16, 2025

k8s-ci-robot requested review from apelisse and Jefftree June 16, 2025 06:54

k8s-ci-robot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Jun 16, 2025

inteon force-pushed the use_json_v2 branch 2 times, most recently from a4b6871 to bdce391 Compare June 18, 2025 10:14

k8s-ci-robot assigned BenTheElder and liggitt Jun 18, 2025

k8s-ci-robot assigned jpbetz Jun 19, 2025

liggitt reviewed Jun 19, 2025

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 28, 2025

inteon force-pushed the use_json_v2 branch from 5b8c555 to 373f016 Compare June 30, 2025 08:19

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 30, 2025

inteon force-pushed the use_json_v2 branch from 373f016 to f90a164 Compare June 30, 2025 08:31

liggitt moved this to Dependencies to replace/remove in [sig-architecture] Dependency management Jul 2, 2025

liggitt added this to [sig-architecture] Dependency management Jul 2, 2025

inteon added 3 commits August 19, 2025 12:50

migrate github.com/json-iterator/go to encoding/json/v2

c288d82

use MarshalJSONTo function

99ecf29

Signed-off-by: Tim Ramlot <[email protected]>

use UnmarshalJSONFrom function

03d95ec

Signed-off-by: Tim Ramlot <[email protected]>

inteon added 6 commits August 19, 2025 12:50

peformance tuning

7290eb6

Signed-off-by: Tim Ramlot <[email protected]>

introduce MarshalValue

7580c7e

Signed-off-by: Tim Ramlot <[email protected]>

upgrade github.com/go-json-experiment/json

3f1e73c

Signed-off-by: Tim Ramlot <[email protected]>

reduce serialize allocations

b2387ea

Signed-off-by: Tim Ramlot <[email protected]>

improve sorting

27e265b

Signed-off-by: Tim Ramlot <[email protected]>

cleanup serialisation

ef02f1b

Signed-off-by: Tim Ramlot <[email protected]>

inteon force-pushed the use_json_v2 branch from f90a164 to 3f1e73c Compare August 30, 2025 12:00

upgrade github.com/go-json-experiment/json

8273db4

Signed-off-by: Tim Ramlot <[email protected]>

dims mentioned this pull request Sep 9, 2025

[WIP] Switch to building golang outselves kubernetes/release#4111

Closed

lalitc375 mentioned this pull request Sep 16, 2025

Add more test coverage for searialization and deserialization #303

Open

Migrate to encoding/json/v2 #292

Are you sure you want to change the base?

Migrate to encoding/json/v2 #292

Conversation

inteon commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dims commented Jun 17, 2025

Uh oh!

dims commented Jun 18, 2025

Uh oh!

inteon commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dims commented Jun 18, 2025

Uh oh!

liggitt commented Jun 19, 2025

Uh oh!

liggitt commented Jun 19, 2025

Uh oh!

liggitt Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

liggitt Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Jun 20, 2025

Uh oh!

liggitt commented Jun 23, 2025

Uh oh!

liggitt commented Jun 24, 2025

Uh oh!

liggitt commented Jul 1, 2025

Uh oh!

jpbetz commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inteon commented Aug 30, 2025

Uh oh!

inteon commented Aug 30, 2025

Uh oh!

BenTheElder commented Aug 30, 2025

Uh oh!

liggitt commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lalitc375 commented Sep 4, 2025

Uh oh!

liggitt commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lalitc375 commented Sep 9, 2025

Uh oh!

liggitt commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inteon commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

liggitt commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

liggitt commented Sep 11, 2025

Uh oh!

dims commented Sep 17, 2025

Uh oh!

Uh oh!

inteon commented Jun 16, 2025 •

edited

Loading

inteon commented Jun 18, 2025 •

edited

Loading

jpbetz commented Jul 1, 2025 •

edited

Loading

liggitt commented Sep 2, 2025 •

edited

Loading

liggitt commented Sep 4, 2025 •

edited

Loading

liggitt commented Sep 9, 2025 •

edited

Loading

inteon commented Sep 11, 2025 •

edited

Loading

liggitt commented Sep 11, 2025 •

edited

Loading