ConvoKit version 2.3.2
This release describes changes that have happened since the v2.3 release, and includes changes from both v2.3.1 and v2.3.2.
Functionality
Naming changes
Utterance.root
has been renamed toUtterance.conversation_id
User
has been renamed toSpeaker
. Functions with 'user' in the name have been renamed accordinglyUser.name
has been renamed toSpeaker.id
(Backwards compatibility will be maintained for all the deprecated attributes and functions.)
Corpus
- Corpus now allows users to generate
pandas
DataFrames for its internal components usingget_conversations_dataframe()
,get_utterances_dataframe()
, andget_speakers_dataframe()
. Conversation
objects have aget_chronological_speaker_list()
method for getting a chronological list of conversation participantsConversation
'sprint_conversation_structure()
method has a new argumentlimit
for limiting the number of utterances displayed to the number specified inlimit
.
Transformers
- New
invalid_val
argument forHyperConvo
that automatically replaces NaN values with the default value specified ininvalid_val
. FightingWords.summarize()
now provides labelled plots
Bug fixes
- Fixed minor bug in
download()
when downloading Reddit corpora. - Fixed bugs in
HyperConvo
that were causing NaN warnings and incorrect calculation. Fixed minor bug that was causing HyperConvo annotations to not be JSON-serializable. - Fixed bug in
Classifier
andBoWClassifier
that was causing inconsistent behaviour for compressed vs. uncompressed vector metadata
Other changes
- Warnings in ConvoKit for deprecation have been made more consistent.
- We now have continuous integration for pushes and pull requests! Thanks to @mwilbz for helping set this up.