refactor: use coder/slog + minor go style changes #107

cstyan · 2025-05-16T19:16:01Z

Changes are broken down in to multiples commits to hopefully make reviewing easy. 1 commit for the slog change and then a commit per Go file for style changes.

Style changes are generally:

try to use full sentences for all comments
try to stick to 120 column lines (not strict) instead of 80
try to one line as many call function, check if err != nil blocks as possible
stick var and const definitions near the top of the file
try to use err or errs for all return type names, previously used problems in some cases but errs in others
some minor optimizations, like the line scanner declaring a new variable in each iteration of a loop
Todo -> TODO, sometimes also useful to do TODO (name): to make it easier to find things a specific author meant to follow up on
comments for types/functions should generally start with // FunctionName/TypeName ... though I'm now seeing places I didn't update that

In general there's very few tests for the Go code here, would we like more or is there some testing that spins up the entire registry to validate things? I didn't see any makefile.

Signed-off-by: Callum Styan <[email protected]>

Parkreiner

Just commenting for now, since it sounded like there might be some more changes you want to make, but I'm okay with making these changes

And to be clear, when I'm asking a question, that's not me trying to be defensive – I'm just trying to understand how big the gap between my TypeScript way of doing things is with how a Gopher usually does stuff

Parkreiner · 2025-05-16T19:26:53Z

cmd/readmevalidation/coderresources.go


 	lineScanner := bufio.NewScanner(strings.NewReader(trimmed))
 	for lineScanner.Scan() {
 		lineNum++
-		nextLine := lineScanner.Text()
+		nextLine = lineScanner.Text()


I'm not sure I understand the point of this change, since nextLine isn't ever used outside the loop. I'm not a fan of scope pollution, and try to keep scoping as aggressively small as possible, even in the same function

Is this mostly a memory optimization?

I feel like I see code all the time in Go that looks just like what we used to have. Especially with range loops. Does the below example have the same problems as the old approach, where we're declaring new block-scoped variables on the stack once per iteration?

for i, value := range exampleSlice { // Stuff }

Is there an optimization that range loops have that doesn't exist with other loops?

Yeah this is just an optimization to reduce memory allocations. Very minor in this case since I doubt this loop has a lot of iterations, but without this a new string for nextLine is allocated for each iteration of the loop.

The Go compiler already does an optimization itself for for thing := range anotherThing to do the same optimization, assigning to the same var for each iteration rather than allocating a new one every time.

Is that actually true nowadays? I know it used to be, but they changed the range behavior to isolate variables for each declaration in Go 1.22:
https://go.dev/blog/loopvar-preview

Is Go doing escape analysis for the variable to see if it needs to be kept around for closures, and only reusing the declaration if it knows that the variable doesn't need to be long-lived? JS enforces scoped behavior for every loop that uses let and const, but I imagine it can't do those optimizations natively because it's a JIT language

Is Go doing escape analysis for the variable to see if it needs to be kept around for closures, and only reusing the declaration if it knows that the variable doesn't need to be long-lived?

I believe so, yes. It essentially looks for internal function calls or go routine calls that pass in the declared variable.

If I understand correctly the optimization will still be applied if it can detect that the variable is only used within the scope of that loop. If we wanted to, we can write a simple benchmark to explore this.

Parkreiner · 2025-05-16T19:43:45Z

cmd/readmevalidation/repostructure.go

+	var (
+		err    error
+		subDir os.FileInfo
+	)


Is this something that Go engineers typically do? I guess I just expected these parentheses declarations to be mainly used for declaring groups of related variables. Right now, the variables aren't directly related (aside from being scoped to the function), and take up more lines total now

Personal preference in some cases. The convention is either if the variables are logically related to each other, or to help with readability such as when there's multiple variables declared near eachother and you want to avoid repeating the var keyword.

In this case, I wanted both to get their respective default values and allow for turning the previous lines 17-18 into one line.

The other option was to do:

var subDir os.FileInfo var err Error

Parkreiner · 2025-05-16T19:49:04Z

cmd/readmevalidation/repostructure.go

 	}
+
+	errs := []error{}


Question I've had for a bit: does it matter whether errs is defined with an allocation or as a nil slice, since we're not serializing it as JSON?

I know that Go recommends that you don't differentiate between a nil slice and an empty, allocated slice aside from JSON output, but aside from JSON, are there ever any times when you'd want to do an allocation for a slice that might stay empty?

Question I've had for a bit: does it matter whether errs is defined with an allocation or as a nil slice, since we're not serializing it as JSON?

Do you mean empty slice, as opposed to nil slice? Rather than doing var slice := make(...)?

Usually I prefer doing make(...) with some specified length/capacity since that allows for either starting out with a slice of the size you need, or at least of some reasonable size. Every call to append when the underlying memory no longer has remaining capacity for what you're trying to append results in reallocation of the slice with 2x the current capacity.

When we don't know what the final length might be and it is possible that it could be 0, using []error{} ensures we don't allocate any space for the item storage portion of the slice.

Even if we did errs := make([]error, 0, 10) the underlying item storage would still be allocated for 10 items.

In general it's best to avoid nil slices for return values, though they can be used for function parameters/optional values.

One important point to note is that under the hood you can append to a nil slice, it will be treated as a 0 length empty slice on the first append.

To be clear, I mean the difference between:

var errs []error

Which is always created as nil and never involves upfront allocations. If I understand right, the allocations only happen behind the scenes if you pass the value into an append call, and if you never call append, the slice lives exclusively on the stack. But if you try to JSON-serialize the value directly, it'll become JSON null

and

errs := []error{}

Which always causes an allocation, just with a backing array of length 0. This is less efficient from a memory standpoint, but for JSON serialization, it's safer because it's always turned into []

Yep you're correct. The slice header exists on the stack, and with the var errs []error version we have just the slice header (24 bytes) and the pointer to the backing array is nil since we don't have one yet.

In the case of errs := []error{} the backing array is initialized on the heap, but I believe that it's using at most 8 bytes until we do some amount of appends to the slice.

IMO, and happy to discuss whether we want to use this pattern or something else, is that it's always better to not have the potential for a nil slice to avoid the potential for attempted indexing into a nil slice. Whether that be in our actual code just within a test.

cmd/readmevalidation/repostructure.go

cmd/readmevalidation/readmefiles.go

cmd/readmevalidation/contributors.go

Parkreiner · 2025-05-16T20:10:57Z

cmd/readmevalidation/contributors.go

@@ -318,19 +310,18 @@ func validateAllContributorFiles() error {
 		return err
 	}

-	log.Printf("Processing %d README files\n", len(allReadmeFiles))
+	logger.Info(context.Background(), "Processing README files", "num_files", len(allReadmeFiles))


I'm still new to structured logging. Is there any special behavior/benefit you get if you use the same key multiple times? I guess I'm just wondering how much of a concern it is to make sure you're using the same keys each time you describe the same "resource", particularly for a function call that takes a variadic slice of empty interfaces (so basically zero type-safety)

It's not the end of the world if you don't use the same key, but it does make searching for logs in some kind of log aggregation system much easier.

For example, a system I used to work on referred to the same internal tenant type within the system as variations of user, tenant, id, etc. Remembering which key was used on which logged lines complicated searches when I knew within I needed to see info for tenant="1234" but on some lines the logging was user="1234".

Again this is likely less important in the case of the registry but still a good practice.

cmd/readmevalidation/readmefiles.go

Signed-off-by: Callum Styan <[email protected]>

bcpeinhardt · 2025-05-19T14:45:01Z

@cstyan

To your point on the tests, I think this was quite quickly written validation code meant to duplicate validation done in the coder/registry-server repo, so the larger testing story of the registry is located there. That said, we should have this code live in one place where it is well tested and import it where we need it.
I wonder how many of these we can lint in our local dev process? I think it'd be awesome to come to a team preference on stuff like "prefer handling err != nil checks inline when possible" and then lint for those things.

Parkreiner · 2025-05-19T15:48:26Z

@bcpeinhardt Yeah, the thinking at the time was that because we had so many real-world README files already, we could use them as an implicit test case. The only reason why there's that one test case now is because it made development easier when I was writing the functionality

(Also, the modules repo had validation logic that straight-up didn't work and also didn't have any tests, so we figured that if nothing else, the new code would be an improvement 🫣)

cmd/readmevalidation/coderresources.go

cmd/readmevalidation/contributors.go

cmd/readmevalidation/main.go

cmd/readmevalidation/readmefiles.go

Parkreiner

I'm approving because I trust you, and I want to be clear: I hope my questions/skepticism don't come across as personal attacks towards you. I trust you – I think I might just be going through some culture shock as I figure out how to be a gopher myself

I do think it might be good to get some extra input from @f0ssel on some of the changes you were asking about before merging, though. He was the one who originally reviewed the code as I was submitting it

cstyan · 2025-05-22T16:41:49Z

I'm approving because I trust you, and I want to be clear: I hope my questions/skepticism don't come across as personal attacks towards you. I trust you – I think I might just be going through some culture shock as I figure out how to be a gopher myself

I do think it might be good to get some extra input from @f0ssel on some of the changes you were asking about before merging, though. He was the one who originally reviewed the code as I was submitting it

Not at all, I appreciate the discussion and being challenged on some of this. It's a good reminder that a) some of what Go does feels overly prescriptive or pedantic from the outside without giving clear reasons why, and it's a good reminder for me to reevaluate whether my own understanding of the why is still valid or it's just semantic convention because the style guide says so.

There may also be some style changes I've applied here that are in addition to the Go or Google conventions, IIRC we do have a Coder Go style guide somewhere, I'll have another read through of that today.

Signed-off-by: Callum Styan <[email protected]>

…ving to double escape Signed-off-by: Callum Styan <[email protected]>

…e type Signed-off-by: Callum Styan <[email protected]>

Parkreiner · 2025-05-23T13:54:27Z

To be honest, I didn't even realize that we had a backend contributing guide, since it's not in the Docs site. I thought we only had a frontend contributing guide

cstyan · 2025-05-26T21:28:58Z

To be honest, I didn't even realize that we had a backend contributing guide, since it's not in the Docs site. I thought we only had a frontend contributing guide

I may be misremembering, I can't find that document now.

Signed-off-by: Callum Styan <[email protected]>

cmd/readmevalidation/coderresources.go

Signed-off-by: Callum Styan <[email protected]>

…ke some stuff easier to read Signed-off-by: Callum Styan <[email protected]>

Signed-off-by: Callum Styan <[email protected]>

cstyan added 7 commits May 16, 2025 10:02

log -> coder/slog change

2c20ae5

Signed-off-by: Callum Styan <[email protected]>

minor style cleanup of readmefiles.go

b1d22f2

Signed-off-by: Callum Styan <[email protected]>

minor style changes in main.go

6b4093c

Signed-off-by: Callum Styan <[email protected]>

minor style changes in errors.go

097b8a2

Signed-off-by: Callum Styan <[email protected]>

minor contributors.go style changes

ed629c1

Signed-off-by: Callum Styan <[email protected]>

minor style changes in corderresources.go

85743cd

Signed-off-by: Callum Styan <[email protected]>

minor style changes in repostructure.go

6e8fc77

Signed-off-by: Callum Styan <[email protected]>

cstyan requested review from Parkreiner and bcpeinhardt May 16, 2025 19:16

Parkreiner reviewed May 16, 2025

View reviewed changes

address review feedback from Michael

4fc0a54

Signed-off-by: Callum Styan <[email protected]>

cstyan commented May 21, 2025

View reviewed changes

cmd/readmevalidation/coderresources.go Outdated Show resolved Hide resolved

cstyan commented May 21, 2025

View reviewed changes

cmd/readmevalidation/contributors.go Outdated Show resolved Hide resolved

cstyan commented May 21, 2025

View reviewed changes

cmd/readmevalidation/main.go Outdated Show resolved Hide resolved

cstyan commented May 21, 2025

View reviewed changes

cmd/readmevalidation/readmefiles.go Outdated Show resolved Hide resolved

Parkreiner approved these changes May 22, 2025

View reviewed changes

cstyan added 3 commits May 22, 2025 11:04

some more minor changes

8ab9fe9

Signed-off-by: Callum Styan <[email protected]>

these regex should be equivalent, but using backticks we can avoid ha…

11e4da3

…ving to double escape Signed-off-by: Callum Styan <[email protected]>

make each phase type string explicitly an instance the validationPhas…

333d962

…e type Signed-off-by: Callum Styan <[email protected]>

Parkreiner assigned cstyan May 23, 2025

Merge branch 'main' into callum-registry-go-style

94f0cb0

Signed-off-by: Callum Styan <[email protected]>

cstyan commented May 29, 2025

View reviewed changes

cmd/readmevalidation/coderresources.go Outdated Show resolved Hide resolved

cstyan added 3 commits May 29, 2025 13:17

give validation phases more obvious name prefixes

9fd7eb4

Signed-off-by: Callum Styan <[email protected]>

back out some non-idiomatic one lining but add helper functions to ma…

673df61

…ke some stuff easier to read Signed-off-by: Callum Styan <[email protected]>

two final fixes

518d860

Signed-off-by: Callum Styan <[email protected]>

cstyan merged commit 13a25ff into main Jun 2, 2025
4 checks passed

cstyan deleted the callum-registry-go-style branch June 2, 2025 19:23

refactor: use coder/slog + minor go style changes #107

refactor: use coder/slog + minor go style changes #107

Uh oh!

Conversation

cstyan commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Parkreiner left a comment

Choose a reason for hiding this comment

Uh oh!

Parkreiner May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Parkreiner May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Parkreiner May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bcpeinhardt commented May 19, 2025

Uh oh!

Parkreiner commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Parkreiner left a comment

Choose a reason for hiding this comment

Uh oh!

cstyan commented May 22, 2025

Uh oh!

Parkreiner commented May 23, 2025

Uh oh!

cstyan commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cstyan commented May 16, 2025 •

edited

Loading

Parkreiner May 16, 2025 •

edited

Loading

Parkreiner May 22, 2025 •

edited

Loading

Parkreiner May 16, 2025 •

edited

Loading

Parkreiner commented May 19, 2025 •

edited

Loading