Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conditional breaks are lost when converting from docx #2415

Closed
soerface opened this issue Sep 25, 2015 · 2 comments · Fixed by #2432
Closed

Conditional breaks are lost when converting from docx #2415

soerface opened this issue Sep 25, 2015 · 2 comments · Fixed by #2432

Comments

@soerface
Copy link

If I have a docx document which contains conditional breaks (press Ctrl + - in MS Word 2013), those are lost when converting to markdown or to HTML.

pandoc -f docx -t html -o out.html input.docx

I expected that the output contains ­ entities. Instead, nothing at all was added for the conditional breaks, they are just beeing ignored.

@mb21
Copy link
Collaborator

mb21 commented Sep 25, 2015

Unfortunately, Word doesn't use the unicode character 'SOFT HYPHEN' (U+00AD) but instead creates it's own XML element <w:softHyphen/>. The docx reader should be adjusted to fix this.

@jgm
Copy link
Owner

jgm commented Sep 25, 2015

@jkr - this sounds like a pretty easy thing to fix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants