BUG: fix to_latex() when using MultiIndex with NaN in (#14249) #19910

tomneep · 2018-02-26T16:26:23Z

closes BUG: NaN in multi-index and to_latex() #14249
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

This attempt to close #14249 which actually reports two related issues (one in the issue one in the SO link) with to_latex() with MultiIndex when the MultiIndex contains a NaN. The output is now more consistent with what you'd get from to_csv() or to_html().

jreback · 2018-02-27T01:18:44Z

@tomneep are you testing both issues in the issue?

jreback · 2018-02-27T01:18:54Z

cc @toobaz if any comments

tomneep · 2018-02-27T07:53:02Z

@jreback yes, I only added one test but parametrized it to cover both of the issues

toobaz · 2018-03-04T12:52:59Z

cc @toobaz if any comments

Sorry for the delay. I think the patch works as intended, but I also think that replacing lev2 = lev.format() and all the new blank definition with

lev2 = self.frame.index.get_level_values(i).format()

would be cleaner.

toobaz · 2018-03-04T12:54:13Z

doc/source/whatsnew/v0.23.0.txt

@@ -864,6 +864,7 @@ I/O
 - Bug in :func:`read_csv` where missing values were not being handled properly when ``keep_default_na=False`` with dictionary ``na_values`` (:issue:`19227`)
 - Bug in :func:`read_sas` where a file with 0 variables gave an ``AttributeError`` incorrectly. Now it gives an ``EmptyDataError`` (:issue:`18184`)
 - Bug in :func:`DataFrame.to_latex()` where pairs of braces meant to serve as invisible placeholders were escaped (:issue:`18667`)
+- Bug in :func:`DataFrame.to_latex()` where a ``NaN`` in a ``MultiIndex`` would cause an ``IndexError`` or incorrect output (:issue:`14249`)


where an all-NaN level in a MultiIndex

(I think)

In the SO link in #14249 there is a case where NaNs are being silently replaced with other non-NaN values in the level and this fixes both that issue and the one actually reported

tomneep · 2018-03-06T09:30:49Z

Hi @toobaz,

Thanks for checking the patch, I wanted to keep the changes as minimal as possible as it is my first contribution.
I agree that this isn't so clean so I had a go at refactoring that section of code- see
d63e88f.

The tests all pass so if you prefer this approach (I certainly do) I can make a PR for that commit instead (maybe after a bit of tidying).

tomneep · 2018-03-08T10:25:58Z

Hi @jreback @toobaz
Since I see that #20032 is also addressing the same area of code to this, I'd like to get some clarification to see if we can move this forward.

The rewrite I mention in the comment above closes the issue this PR initially aimed to close as well as #19981, and an (as far as I can tell unreported) issue that to_latex(sparsify=False) is still sparsifying regardless in the master. My impression is that this would be a good place for #20032 to start making changes but I would like you opinion.

The main thing I don't know, is whether I should create a new PR since the changes are now more substantial than this initial one, or if I can just keep using this one and commit the changes to this branch?

jreback · 2018-03-14T11:03:15Z

can you rebase. we moved some files around.

codecov · 2018-03-14T12:17:05Z

Codecov Report

Merging #19910 into master will increase coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master   #19910      +/-   ##
==========================================
+ Coverage   91.77%   91.78%   +<.01%     
==========================================
  Files         152      152              
  Lines       49185    49186       +1     
==========================================
+ Hits        45140    45143       +3     
+ Misses       4045     4043       -2

Flag	Coverage Δ
#multiple	`90.16% <100%> (ø)`	⬆️
#single	`41.85% <0%> (-0.01%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/formats/latex.py	`100% <100%> (ø)`	⬆️
pandas/util/testing.py	`83.95% <0%> (+0.2%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cad6dc7...2a72b60. Read the comment docs.

tomneep · 2018-03-14T13:23:01Z

Ok, rebased. I also rebased my alternative branch that tidies this portion of the code up: c0daa15

My preference would be to merge that branch instead (in which case a couple more tests could be added) but let me know which you prefer.

toobaz · 2018-04-18T04:33:54Z

@tomneep again sorry for the delay. I also think the alternative branch is much better: please feel free to open another PR.

(Please also clarify whether that branch is incompatible with/alternative to #20032, or just requires some merge work)

tomneep · 2018-04-23T16:01:45Z

I've opened #20797 for the improved branch.

Pehaps it will mean some of #20032 won't be needed but I don't think this is completely incompatible.

jreback added Bug IO LaTeX to_latex labels Feb 27, 2018

toobaz reviewed Mar 4, 2018

View reviewed changes

tomneep mentioned this pull request Mar 8, 2018

Latex bugs #20032

Closed

4 tasks

BUG: fix to_latex() when using MultiIndex with NaN in (#14249)

2a72b60

tomneep force-pushed the to_latex_nan_fix branch from 5dbe1af to 2a72b60 Compare March 14, 2018 12:16

tomneep mentioned this pull request Apr 23, 2018

BUG: fix to_latex() when using MultiIndex with NaN in (#14249) (v2) #20797

Merged

7 tasks

tomneep closed this Apr 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: fix to_latex() when using MultiIndex with NaN in (#14249) #19910

BUG: fix to_latex() when using MultiIndex with NaN in (#14249) #19910

tomneep commented Feb 26, 2018

jreback commented Feb 27, 2018

jreback commented Feb 27, 2018

tomneep commented Feb 27, 2018

toobaz commented Mar 4, 2018

toobaz Mar 4, 2018

tomneep Mar 6, 2018

toobaz Mar 6, 2018

tomneep commented Mar 6, 2018

tomneep commented Mar 8, 2018

jreback commented Mar 14, 2018

codecov bot commented Mar 14, 2018 •

edited

Loading

tomneep commented Mar 14, 2018

toobaz commented Apr 18, 2018

tomneep commented Apr 23, 2018

BUG: fix to_latex() when using MultiIndex with NaN in (#14249) #19910

BUG: fix to_latex() when using MultiIndex with NaN in (#14249) #19910

Conversation

tomneep commented Feb 26, 2018

jreback commented Feb 27, 2018

jreback commented Feb 27, 2018

tomneep commented Feb 27, 2018

toobaz commented Mar 4, 2018

toobaz Mar 4, 2018

Choose a reason for hiding this comment

tomneep Mar 6, 2018

Choose a reason for hiding this comment

toobaz Mar 6, 2018

Choose a reason for hiding this comment

tomneep commented Mar 6, 2018

tomneep commented Mar 8, 2018

jreback commented Mar 14, 2018

codecov bot commented Mar 14, 2018 • edited Loading

Codecov Report

tomneep commented Mar 14, 2018

toobaz commented Apr 18, 2018

tomneep commented Apr 23, 2018

codecov bot commented Mar 14, 2018 •

edited

Loading