Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: Don't parse inline quotes in skipped lines #14514

Merged

Conversation

gfyoung
Copy link
Member

@gfyoung gfyoung commented Oct 27, 2016

Title is self-explanatory. Closes #14459.

@jorisvandenbossche jorisvandenbossche added Bug IO CSV read_csv, to_csv labels Oct 27, 2016
@jorisvandenbossche jorisvandenbossche added this to the 0.19.1 milestone Oct 27, 2016
@jorisvandenbossche
Copy link
Member

@gfyoung Thanks a lot!
Looks good to me.

Do we have benchmarks for skiprows?

@gfyoung
Copy link
Member Author

gfyoung commented Oct 27, 2016

@jorisvandenbossche : Yes, we do in io_bench.py. I didn't see any major performance hits.

@jorisvandenbossche
Copy link
Member

OK, thanks.

@jreback or @chris-b1 Can somebody else give a look as well?

@gfyoung gfyoung force-pushed the skiprows-quotechar-patch branch from 5a0556e to 4fb55f6 Compare October 27, 2016 16:01
@codecov-io
Copy link

codecov-io commented Oct 28, 2016

Current coverage is 85.26% (diff: 100%)

Merging #14514 into master will not change coverage

@@             master     #14514   diff @@
==========================================
  Files           140        140          
  Lines         50672      50672          
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
  Hits          43208      43208          
  Misses         7464       7464          
  Partials          0          0          

Powered by Codecov. Last update 7f5a45c...2e41dab

@gfyoung gfyoung force-pushed the skiprows-quotechar-patch branch from 4fb55f6 to 1662b8f Compare October 28, 2016 17:21
@gfyoung gfyoung force-pushed the skiprows-quotechar-patch branch from 1662b8f to 2e41dab Compare October 30, 2016 19:03
@jreback
Copy link
Contributor

jreback commented Oct 31, 2016

lgtm.

@gfyoung can you just do a run of the csv asv's just for a quick check.

@gfyoung
Copy link
Member Author

gfyoung commented Oct 31, 2016

@jreback : As I alluded to above, I saw no major performance hits for the csv benchmarks.

@jorisvandenbossche jorisvandenbossche merged commit b088112 into pandas-dev:master Oct 31, 2016
@jorisvandenbossche
Copy link
Member

@gfyoung Thanks!

@gfyoung gfyoung deleted the skiprows-quotechar-patch branch October 31, 2016 20:45
jorisvandenbossche pushed a commit to jorisvandenbossche/pandas that referenced this pull request Nov 2, 2016
yarikoptic added a commit to neurodebian/pandas that referenced this pull request Nov 18, 2016
Version 0.19.1

* tag 'v0.19.1': (43 commits)
  RLS: v0.19.1
  DOC: update whatsnew/release notes for 0.19.1 (pandas-dev#14573)
  [Backport pandas-dev#14545] BUG/API: Index.append with mixed object/Categorical indices (pandas-dev#14545)
  DOC: rst fixes
  [Backport pandas-dev#14567] DEPR: add deprecation warning for com.array_equivalent (pandas-dev#14567)
  [Backport pandas-dev#14551] PERF: casting loc to labels dtype before searchsorted (pandas-dev#14551)
  [Backport pandas-dev#14536] BUG: DataFrame.quantile with NaNs (GH14357) (pandas-dev#14536)
  [Backport pandas-dev#14520] BUG: don't close user-provided file handles in C parser (GH14418) (pandas-dev#14520)
  [Backport pandas-dev#14392] BUG: Dataframe constructor when given dict with None value (pandas-dev#14392)
  [Backport pandas-dev#14514] BUG: Don't parse inline quotes in skipped lines (pandas-dev#14514)
  [Bacport pandas-dev#14543] BUG: tseries ceil doc fix (pandas-dev#14543)
  [Backport pandas-dev#14541] DOC: Simplify the gbq integration testing procedure for contributors (pandas-dev#14541)
  [Backport pandas-dev#14527] BUG/ERR: raise correct error when sql driver is not installed (pandas-dev#14527)
  [Backport pandas-dev#14501] BUG: fix DatetimeIndex._maybe_cast_slice_bound for empty index (GH14354) (pandas-dev#14501)
  [Backport pandas-dev#14442] DOC: Expand on reference docs for read_json() (pandas-dev#14442)
  BLD: fix 3.4 build for cython to 0.24.1
  [Backport pandas-dev#14492] BUG: Accept unicode quotechars again in pd.read_csv
  [Backport pandas-dev#14496] BLD: Support Cython 0.25
  [Backport pandas-dev#14498] COMPAT/TST: fix test for range testing of negative integers to neg powers
  [Backport pandas-dev#14476] PERF: performance regression in Series.asof (pandas-dev#14476)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug IO CSV read_csv, to_csv
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants