Add rowspan support to DocBook reader. #10981

SeanESCA · 2025-07-22T13:57:47Z

This PR adds rowspan support to the DocBook reader by reading the morerows attribute in table cells, as mentioned in #10918. Previously, the rowspan was always set to 1 regardless of this attribute. I added a test based on the table in the issue as well.

Besides that, I changed parseTable to read all the rows in thead if there are multiple rows (previously, only the first row was read). It also considers the rows in thead when setting the number of columns. Previously, it only considered those in the body, so it failed to correctly parse tables if there were more cells in a row in the head than the body.

tarleb

LGTM! I added some comments, but they are really just comments; none of them needs fixing.

tarleb · 2025-07-23T08:55:32Z

src/Text/Pandoc/Readers/DocBook.hs

                            if n > 0 then Just n else Nothing
-                      let numrows = maybe 0 maximum $ nonEmpty
-                                                    $ map length bodyrows
+                      let numrows = maybe 0 maximum $ nonEmpty 


That trailing space is probably unintentional?

tarleb · 2025-07-23T09:00:33Z

src/Text/Pandoc/Readers/DocBook.hs

 import Data.List.NonEmpty (nonEmpty)
 import Data.Maybe (catMaybes,fromMaybe,mapMaybe,maybeToList)
-import Data.Text (Text)
+import Data.Text (Text, unpack)


There are many modules that define an unpack function, so most modules in pandoc use the more explicit T.unpack. That's by no means a rule though, importing it this way is fine.

tarleb · 2025-07-23T09:13:25Z

src/Text/Pandoc/Readers/DocBook.hs

+  let rowDistance mr = do
+        case readMaybe $ unpack mr :: Maybe Int of
+          Just moreRow -> RowSpan $ moreRow + 1
+          _ -> 1


Some linting tools will complain about this wildcard use, suggesting to either use a descriptive name like _notAnInt or to be explicit and match on Nothing. This can help to avoid confusion.

Doesn't need fixing, purely FYI.

I was thinking about that, but figured it may be better to be consistent with how colspan was handled.

SeanESCA · 2025-07-23T10:26:58Z

Thanks @tarleb! I've removed the trailing space and changed the unpack import.

tarleb · 2025-07-24T21:06:43Z

The test failure is unrelated to these changes, so I'm merging this.

Thanks!

SeanESCA added 4 commits July 22, 2025 14:30

Read morerows from DocBook table cells.

1d9deee

Added test for rowspan in DocBook reader.

a77c2f6

Read rows in DocBook thead, and use thead in column count.

2eb14a1

Edited Docbook table test for multiple thead rows

a89a5c5

tarleb approved these changes Jul 23, 2025

View reviewed changes

Edited unpack import and removed trailing space.

243df28

tarleb merged commit 937e20d into jgm:main Jul 24, 2025
10 of 14 checks passed

SeanESCA deleted the docbook-rowspan branch July 25, 2025 07:40

yanntrividic pushed a commit to yanntrividic/pandoc that referenced this pull request Jul 25, 2025

DocBook reader: Add rowspan support. (jgm#10981)

53c3f88

jgm mentioned this pull request Jul 29, 2025

Docbook reader: support rowspan and colspan #10918

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add rowspan support to DocBook reader. #10981

Add rowspan support to DocBook reader. #10981

Uh oh!

SeanESCA commented Jul 22, 2025 •

edited

Loading

Uh oh!

tarleb left a comment

Uh oh!

tarleb Jul 23, 2025

Uh oh!

tarleb Jul 23, 2025

Uh oh!

tarleb Jul 23, 2025

Uh oh!

SeanESCA Jul 23, 2025

Uh oh!

SeanESCA commented Jul 23, 2025

Uh oh!

tarleb commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add rowspan support to DocBook reader. #10981

Add rowspan support to DocBook reader. #10981

Uh oh!

Conversation

SeanESCA commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tarleb left a comment

Choose a reason for hiding this comment

Uh oh!

tarleb Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

tarleb Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

tarleb Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

SeanESCA Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

SeanESCA commented Jul 23, 2025

Uh oh!

tarleb commented Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

SeanESCA commented Jul 22, 2025 •

edited

Loading