Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to read figure caption from DOCX #9391

Closed
jgm opened this issue Jan 30, 2024 Discussed in #9390 · 0 comments
Closed

How to read figure caption from DOCX #9391

jgm opened this issue Jan 30, 2024 Discussed in #9390 · 0 comments

Comments

@jgm
Copy link
Owner

jgm commented Jan 30, 2024

Discussed in #9390

Originally posted by rgaiacs January 30, 2024
Pandoc has support to Microsoft Word's native table caption. For example, when mwe-table.docx

Screenshot 2024-01-30 145920

is converted using pandoc --from docx --to html mwe-table.docx, it produces

<p>Lorem ipsum.</p>
<table>
<caption><p>Awesome table</p></caption>
<colgroup>
<col style="width: 50%" />
<col style="width: 50%" />
</colgroup>
<thead>
<tr class="header">
<th>A</th>
<th>B</th>
</tr>
</thead>
<tbody>
<tr class="odd">
<td>1</td>
<td>2</td>
</tr>
</tbody>
</table>

Does Pandoc has support to read Microsoft Word's native figure caption? For example, when mwe-img.docx

Screenshot 2024-01-30 155653

is converted using pandoc --from docx --to html mwe-img.docx, it produces

<p>Lorem ipsum.</p>
<p><img src="media/image1.png"
style="width:1.10263in;height:1.10263in" /></p>
<p>Figure Blue square.</p>

instead of

<p>Lorem ipsum.</p>
<figure>
   <img src="media/image1.png"/>
   <figcaption>
      Blue square.
   </figcaption>
</figure>
```</div>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant