docs: add json to full text search document #4865

wojiaodoubao · 2025-10-01T05:12:22Z

No description provided.

wojiaodoubao · 2025-10-02T10:23:32Z

Hi @jackye1995 , could you help review this document update when you have time, thanks very much!

jackye1995 · 2025-10-02T16:43:16Z

docs/src/format/table/index/scalar/fts.md


-The full text search index supports multiple tokenizer types for different text processing needs:
+The full text search index supports multiple tokenizer types for different text processing needs.
+There are two different tokenizer configurations: ```lance_tokenizer``` and ```base_tokenizer```.


no need to do triple backquotes, single backquotes are fine.

jackye1995 · 2025-10-02T16:47:07Z

docs/src/format/table/index/scalar/fts.md

+
+#### Text Tokenizer
+Text Tokenizer is responsible for handling TEXT-type data, which is Utf8, LargeUtf8 or List of them in arrow format.
+The Text Tokenizer behaves consistently in both "query" and "document parsing" scenarios, which means that if a document


I don't think we need double quotes for "query" and "document parsing"

also same comment to a few other cases below

jackye1995 · 2025-10-02T16:47:26Z

docs/src/format/table/index/scalar/fts.md

+#### Text Tokenizer
+Text Tokenizer is responsible for handling TEXT-type data, which is Utf8, LargeUtf8 or List of them in arrow format.
+The Text Tokenizer behaves consistently in both "query" and "document parsing" scenarios, which means that if a document
+contains the word "lance," we can retrieve it using a query with "lance."


"lance", and "lance".?

jackye1995 · 2025-10-02T16:50:05Z

docs/src/format/table/index/scalar/fts.md

+age,number,30
+address.city,str,San
+address.city,str,Francisco
+address.zip,number,94102


should define and handle what if the document path contains . or : for parsing and querying

Thanks your nice suggestion! I have updated the example, adding . and : to json text with the corresponding triplets. I also added a unit test to . and :.

github-actions bot added the documentation Improvements or additions to documentation label Oct 1, 2025

wojiaodoubao force-pushed the fts-json-doc branch from 1397aa7 to 98dbbab Compare October 1, 2025 06:36

wojiaodoubao mentioned this pull request Oct 2, 2025

Add full text json index #4749

Open

jackye1995 reviewed Oct 2, 2025

View reviewed changes

wojiaodoubao force-pushed the fts-json-doc branch from 98dbbab to 7fb8468 Compare October 3, 2025 08:24

github-actions bot added the python label Oct 3, 2025

wojiaodoubao force-pushed the fts-json-doc branch 2 times, most recently from e1bbd02 to 9e66d90 Compare October 3, 2025 08:52

docs: add json to full text search document

d7c83b0

wojiaodoubao force-pushed the fts-json-doc branch from 9e66d90 to d7c83b0 Compare October 3, 2025 08:54

fmt

b9c42fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: add json to full text search document #4865

docs: add json to full text search document #4865

wojiaodoubao commented Oct 1, 2025

Uh oh!

wojiaodoubao commented Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Uh oh!

wojiaodoubao Oct 3, 2025

Uh oh!

Uh oh!

docs: add json to full text search document #4865

Are you sure you want to change the base?

docs: add json to full text search document #4865

Conversation

wojiaodoubao commented Oct 1, 2025

Uh oh!

wojiaodoubao commented Oct 2, 2025

Uh oh!

jackye1995 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

jackye1995 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

jackye1995 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

jackye1995 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

jackye1995 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

wojiaodoubao Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!