You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Have you tried rerunning the command with the --clean flag?
Describe the issue
I encountered an ImportError requiring Chinese tokenization support, which conflicts with numpy and sklearn versions.
I installed MFA via conda, and the version of numpy is 1.26.4.
I tried aligning on a Mandarin corpus, but the terminal prompted that I needed to install dependencies.
ImportError: Please install Chinese tokenization support via pip install spacy-pkuseg dragonmapper hanziconv.
However, spacy-pkuseg requires numpy>=2.0.0. I attempted to run the command provided in the error message, which updated Numpy to 2.0.2. However, if I install Numpy 2.0.2, the dependency installed with MFA is sklearn 1.2.2, and these two packages seem to conflict. I encountered an error:
ValueError: numpy.dtype size changed, may indicate binary incompatibility. Expected 96 from C header, got 88 from PyObject
And I found a solution here: StackOverflow, which suggested downgrading numpy to 1.26.4.
I ultimately found a temporary solution by specifying an older version of spacy-pkuseg:
pip install spacy-pkuseg==0.0.33
This version only need numpy>=1.19.0.
I hope MFA can resolve this dependency issue and update the documentation, as there are no instructions indicating that I need to install these dependencies, but I receive a prompt when executing mfa align ....
For Reproducing your issue
Corpus structure
What language is the corpus in? Mandarin
How many files/speakers? 1 speaker, 1 audio and 1 text. Just for test.
Are you using lab files or TextGrid files for input? No.
Dictionary
Are you using a dictionary from MFA? If so, which one? Yes. mandarin_china_mfa
If it's a custom dictionary, what is the phoneset?
Acoustic model
If you're using an acoustic model, is it one download through MFA? If so, which one? Yes. mandarin_mfa
If it's a model you've trained, what data was it trained on?
Debugging checklist
mfa version
?--clean
flag?Describe the issue
I encountered an ImportError requiring Chinese tokenization support, which conflicts with numpy and sklearn versions.
I installed MFA via conda, and the version of numpy is 1.26.4.
I tried aligning on a Mandarin corpus, but the terminal prompted that I needed to install dependencies.
However, spacy-pkuseg requires
numpy>=2.0.0
. I attempted to run the command provided in the error message, which updated Numpy to 2.0.2. However, if I install Numpy 2.0.2, the dependency installed with MFA is sklearn 1.2.2, and these two packages seem to conflict. I encountered an error:And I found a solution here: StackOverflow, which suggested downgrading numpy to 1.26.4.
I ultimately found a temporary solution by specifying an older version of spacy-pkuseg:
This version only need
numpy>=1.19.0
.I hope MFA can resolve this dependency issue and update the documentation, as there are no instructions indicating that I need to install these dependencies, but I receive a prompt when executing
mfa align ...
.For Reproducing your issue
mandarin_china_mfa
mandarin_mfa
To reproduce:
Log file
sp1.log
Desktop (please complete the following information):
Additional context
The text was updated successfully, but these errors were encountered: