Skip to content

ConvoKit version 2.3.2

Compare
Choose a tag to compare
@calebchiam calebchiam released this 03 Jun 07:18
· 556 commits to master since this release

This release describes changes that have happened since the v2.3 release, and includes changes from both v2.3.1 and v2.3.2.

Functionality

Naming changes

  • Utterance.root has been renamed to Utterance.conversation_id
  • User has been renamed to Speaker. Functions with 'user' in the name have been renamed accordingly
  • User.name has been renamed to Speaker.id

(Backwards compatibility will be maintained for all the deprecated attributes and functions.)

Corpus

  • Corpus now allows users to generate pandas DataFrames for its internal components using get_conversations_dataframe(), get_utterances_dataframe(), and get_speakers_dataframe().
  • Conversation objects have a get_chronological_speaker_list() method for getting a chronological list of conversation participants
  • Conversation's print_conversation_structure() method has a new argument limit for limiting the number of utterances displayed to the number specified in limit.

Transformers

  • New invalid_val argument for HyperConvo that automatically replaces NaN values with the default value specified in invalid_val.
  • FightingWords.summarize() now provides labelled plots

Bug fixes

  • Fixed minor bug in download() when downloading Reddit corpora.
  • Fixed bugs in HyperConvo that were causing NaN warnings and incorrect calculation. Fixed minor bug that was causing HyperConvo annotations to not be JSON-serializable.
  • Fixed bug in Classifier and BoWClassifier that was causing inconsistent behaviour for compressed vs. uncompressed vector metadata

Other changes

  • Warnings in ConvoKit for deprecation have been made more consistent.
  • We now have continuous integration for pushes and pull requests! Thanks to @mwilbz for helping set this up.