Add first draft of default attribute definitions #1098

eemeli · 2025-09-08T12:11:23Z

Adds an initial set of expression, markup, and message attribute definitions.

The proposed attributes are drawn from:

XLIFF 2.2
The messages.json web extension definition for placeholders.example
Enumerate supported metadata/properties for messages, sections & resources eemeli/message-resource-wg#19

As noted in the text, this is not intended as a final list, but as a starting point. The text is not being currently proposed to be normative, but we could change that later.

aphillips

Good start. Lots of nit-picky comments.

Maybe a good question is: should these be directly incorporated? Or should all of these XLIFFy things be namespaced? Some of what XLIFF does doesn't apply to UMF messages and some of it would be much better on a message resource level (instead of cluttering up the message itself).

spec/attributes/README.md

aphillips · 2025-09-08T16:17:49Z

spec/attributes/README.md

+As all _attributes_ with _reserved identifiers_ are reserved,
+definitions are provided here for common _attribute_ use cases.
+Custom _attributes_ SHOULD use a _custom identifier_,
+preferably one with an appropriate _namespace_.


This paragraph feels weird? It's unclear if we're trying to say that the "definitions" found here are normative. This also tangles with the reserves/custom identifier bear in a novel way.

Perhaps:

Suggested change

As all _attributes_ with _reserved identifiers_ are reserved,

definitions are provided here for common _attribute_ use cases.

Custom _attributes_ SHOULD use a _custom identifier_,

preferably one with an appropriate _namespace_.

_Attributes_ defined by this specification use _reserved identifiers_.

Custom _attributes_ MUST use a _custom identifier_.

The use of a _namespace_ is RECOMMENDED for implementation-defined

or domain-specific _attributes_.

The MUST you propose is actually stronger than what we've currently in the spec, where the strongest language we use is

message-format-wg/spec/syntax.md

Lines 911 to 914 in 7e65f6e

Implementers and authors of _functions_ and _messages_,

including _functions_, _options_, and _variables_,

SHOULD avoid creating _names_ that could produce confusion or harm usability

by choosing _names_ consistent with the following guidelines.

This paragraph should is intended to match/recall what's already elsewhere in the spec, rather than adding new normative text. It could be dropped completely, if that would be clearer?

The MUST is stronger, because it is effectively authoring advice. You're right that it probably should be a SHOULD. Perhaps something like:

Use a custom identifier for other attributes.

aphillips · 2025-09-08T16:20:30Z

spec/attributes/README.md

+
+#### @translate
+
+_Value:_ `yes` or `no`.


Indicate that yes is default?

Is there a reason attributes don't follow a similar structure to functions and their options here?

I don't think we've agreement that yes is the default. In fact, for expressions, I would think that the general default might in fact be no to indicate that a translator is not expected to make any changes to the expression.

Considering this a bit more, maybe something like translate=input or translate=|input,minimumFractionDigits| would be better? That would indicate which parts are expected to be translatable.

The default value is no when the attribute is not present, but yes when the attribute is present and has no value, right?

I don't like the values yes/no, but they are inherited from XLIFF (and its friends, such as ITS) and we should probably remain consistent with them (for portability at least)

Ah, that's a slightly different undrstanding of "default" than I'd had -- as in, the value that's applied if the attribute is not present at all.

I don't hate the yes/no as they're relatively legible and are perhaps easier to extend with other enum values than e.g. true/false would be. But as they're already in use by XLIFF, we should use the same values.

aphillips · 2025-09-08T16:21:23Z

spec/attributes/README.md

+
+Indicates whether or not the _markup_ and its contents can be re-ordered.
+
+#### @comment


Why not just permit the "global" attributes on markup?

I don't understand what this means.

You're repeating attributes defined above. Why not make those like @comment global to both expressions and markup?

That seems like an editorial fix we could apply later, if it does hold that the annotations continue to match on expressions and markup.

It would be a bad idea for identically-named attributes to diverge. The sets aren't identical, of course.

aphillips · 2025-09-08T16:23:14Z

spec/attributes/README.md

+
+#### @max-length
+
+_Value:_ A strictly positive integer, followed by a space, followed by one of the following:


digit size option?

That's limited to max 99, and we need to allow for limits greater than that.

spec/attributes/README.md

aphillips · 2025-09-08T16:24:14Z

spec/attributes/README.md

+_Value:_ A strictly positive integer, followed by a space, followed by one of the following:
+- `chars`
+- `bytes`
+- `lines`


Good luck with this one.

As in, we should not include it?

Measuring bytes will depend on some character encoding somewhere. Without an indication of the encoding (which this doesn't provide), there is no way to perform the measurement.

(FWIW, you're missing graphemes, which is another measurement (approximately "screen positions", but only approximately so).)

Lines depends on... font, font size, pixel width, line-breaking, hyphenation (insert more here) and are even harder to define that bytes.

Length limitations are a "fact of life" in localization, but badly defined mechanisms for them are not that helpful.

One option would be to leave out the units, and to let the implementation figure out what the limit means, something in the overlap of characters/code points/graphemes.

spec/attributes/README.md

Co-authored-by: Addison Phillips <[email protected]>

eemeli · 2025-09-09T09:47:43Z

Maybe a good question is: should these be directly incorporated? Or should all of these XLIFFy things be namespaced? Some of what XLIFF does doesn't apply to UMF messages and some of it would be much better on a message resource level (instead of cluttering up the message itself).

During yesterday's call, @mihnita also expressed concern regarding cluttering up a message with multiple attributes. His thought was that it would often be preferable to attach a u:id to an expression or markup, and refer to that from a separate message-level block to attach attribute-y metadata to the relevant placeholder(s).

To me, this speaks of a need to have that capability also be well defined, so that it can be ergonomically done across resource formats. In other words, I think we need a JavaDoc-y syntax for message-level attributes.

Add first draft of default attribute definitions

6bc7fc2

eemeli added the Agenda+ Requested for upcoming teleconference label Sep 8, 2025

aphillips reviewed Sep 8, 2025

View reviewed changes

eemeli commented Sep 9, 2025

View reviewed changes

spec/attributes/README.md Show resolved Hide resolved

Apply suggestions from code review

39911f2

Co-authored-by: Addison Phillips <[email protected]>

eemeli requested review from aphillips and mihnita September 9, 2025 09:47

	Implementers and authors of _functions_ and _messages_,
	including _functions_, _options_, and _variables_,
	SHOULD avoid creating _names_ that could produce confusion or harm usability
	by choosing _names_ consistent with the following guidelines.


		Indicates whether or not the _markup_ and its contents can be re-ordered.

		#### @comment


		#### @max-length

		_Value:_ A strictly positive integer, followed by a space, followed by one of the following:

Uh oh!

Add first draft of default attribute definitions #1098

Are you sure you want to change the base?

Add first draft of default attribute definitions #1098

Uh oh!

Conversation

eemeli commented Sep 8, 2025

Uh oh!

aphillips left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

aphillips Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eemeli commented Sep 9, 2025

Uh oh!

Uh oh!

aphillips Sep 9, 2025 •

edited

Loading