[API Proposal]: DbDataReader - span based column name APIs #113741

mgravell · 2025-03-20T19:01:04Z

Background and motivation

Column names are an inherent part of DB reads. Currently, DbDataReader exposes multiple APIs for discussing column names, the most obvious being:

public abstract class DbDataReader
{
    public abstract string GetName(int ordinal);
    public abstract int GetOrdinal(string name);
}

While it is possible to ignore the column names and rely purely on ordinals, a lot of code - especially ORM code - uses the column names to assert the intent (think "select * from orders where region=@region" - what is the column order?)

API Proposal

Suggestion:

public abstract class DbDataReader
{
    public abstract string GetName(int ordinal);
    public abstract int GetOrdinal(string name);

+    public virtual ReadOnlySpan<char> GetNameSpan(int ordinal) => GetName(ordinal).AsSpan();
+    public virtual int GetOrdinal(ReadOnlySpan<char> name) => GetOrdinal(name.ToString());
}

This would allow implementations to use pooled char[] data for the names, only lazily materializing a string if the GetName API is used. Callers such as ORMs could update to use span-based testing, completely avoiding the string usage.

I would happily update Dapper as a prototype.

API Usage

This is an existing Dapper (ORM) usage, taken from AOT generator output:

 public override object? Tokenize(global::System.Data.Common.DbDataReader reader, global::System.Span<int> tokens, int columnOffset)
            {
                for (int i = 0; i < tokens.Length; i++)
                {
                    int token = -1;
                    var name = reader.GetName(columnOffset);
                    var type = reader.GetFieldType(columnOffset);
                    switch (NormalizedHash(name))
                    {
                        case 926444256U when NormalizedEquals(name, "id"):
                            token = type == typeof(int) ? 0 : 3; // two tokens for right-typed and type-flexible
                            break;
                        case 2369371622U when NormalizedEquals(name, "name"):
                            token = type == typeof(string) ? 1 : 4;
                            break;
                        case 4237030186U when NormalizedEquals(name, "birthdate"):
                            token = type == typeof(DateOnly) ? 2 : 5;
                            break;

                    }
                    tokens[i] = token;
                    columnOffset++;

                }
                return null;
            }

The complete change here would be to the line:

- var name = reader.GetName(columnOffset);
+ var name = reader.GetNameSpan(columnOffset);

et voila, zero strings; if the provider supports the usage! obviously this also requires

If in doubt over the .ToString() in the base method: I'll lose GetOrdinal in a heartbeat to get GetNameSpan !

Alternative Designs

No response

Risks

No response

The text was updated successfully, but these errors were encountered:

mgravell · 2025-03-20T19:01:42Z

/cc @roji

dotnet-policy-service · 2025-03-20T19:02:05Z

Tagging subscribers to this area: @roji, @ajcvickers
See info in area-owners.md if you want to be subscribed.

roji · 2025-03-21T07:16:47Z

@mgravell thanks for the suggestion.

In general, we've viewed DbDataReader's name-based APIs as "low-perf" convenience APIs. In other words, a very-high-perf conscious consumer would be expected to just always use ordinals everywhere, and not deal with column names at all; after all, that user is also setting the SQL on the command which produced the reader, so they should be able to know the ordinals for reading out the results.

Is there a reason/scenario that a (high-perf) consumer of DbDataReader would have to deal with column names?

See this somewhat related recent conversation with @Wraith2

mgravell · 2025-03-22T12:58:09Z

@roji imagine the context of Dapper. Right now, in current versions of Dapper, it doesn't attempt to pre-process the SQL to understand context and check whether it can use pure ordinal fetch - it checks the column names (not least, because the query could involve *).

Now, if we parse the query fully, we can check that - and indeed that is planned for TSQL in AOT mode because we have a build-time TSQL parser already, that we can use. However:

we don't have this capability for every SQL variant
for runtime-only queries, we don't want to spend the time checking

So: having the ability to efficiently query the column names could save a considerable number of allocations.

…ance, so let's use the ordinal positions of columns instead of their names to improve performance, even though it reduces the readability of the code. (dotnet/runtime#113741 (comment))

adamburgess · 2025-03-28T16:20:40Z

Executing Stored Procedures would also benefit from this optimisation as the column orders are not directly controllable and must be looked up before reading the result.

mgravell added the api-suggestion Early API idea and discussion, it is NOT ready for implementation label Mar 20, 2025

dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Mar 20, 2025

mgravell added area-System.Data api-ready-for-review API is ready for review, it is NOT ready for implementation and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Mar 20, 2025

dotnet-policy-service bot added the untriaged New issue has not been triaged by the area owner label Mar 20, 2025

teo-tsirpanis removed the api-suggestion Early API idea and discussion, it is NOT ready for implementation label Mar 21, 2025

mgravell closed this as completed Apr 30, 2025

dotnet-policy-service bot removed the untriaged New issue has not been triaged by the area owner label Apr 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API Proposal]: DbDataReader - span based column name APIs #113741

[API Proposal]: DbDataReader - span based column name APIs #113741

mgravell commented Mar 20, 2025 •

edited

Loading

mgravell commented Mar 20, 2025

dotnet-policy-service bot commented Mar 20, 2025

roji commented Mar 21, 2025

mgravell commented Mar 22, 2025

adamburgess commented Mar 28, 2025 •

edited

Loading

[API Proposal]: DbDataReader - span based column name APIs #113741

[API Proposal]: DbDataReader - span based column name APIs #113741

Comments

mgravell commented Mar 20, 2025 • edited Loading

Background and motivation

API Proposal

API Usage

Alternative Designs

Risks

mgravell commented Mar 20, 2025

dotnet-policy-service bot commented Mar 20, 2025

roji commented Mar 21, 2025

mgravell commented Mar 22, 2025

adamburgess commented Mar 28, 2025 • edited Loading

mgravell commented Mar 20, 2025 •

edited

Loading

adamburgess commented Mar 28, 2025 •

edited

Loading