Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diarization #1556

Merged
merged 15 commits into from
Jul 12, 2018
Merged

Diarization #1556

merged 15 commits into from
Jul 12, 2018

Conversation

happyhuman
Copy link
Contributor

The samples in this code demo diarization: Basically who said what (when there is more than just one person talking).

@googlebot googlebot added the cla: yes This human has signed the Contributor License Agreement. label Jul 10, 2018
@happyhuman happyhuman changed the title Diarization [DO NOT MERGE] Diarization Jul 10, 2018
@happyhuman
Copy link
Contributor Author

@nnegrey , @dizcology , @theacodes Can you please review this PR?

@nnegrey
Copy link
Contributor

nnegrey commented Jul 11, 2018

Looks like these use the speech_v1p1beta1 library, can we put these snippets into the beta_snippets.py file?

python transcribe_diarization.py \
resources/Google_Gnome.wav
python transcribe_diarization.py \
gs://cloud-ml-api-e2e-testing/speech/stereo_audio.wav
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cloud-ml-api-e2e-testing public bucket?

@happyhuman
Copy link
Contributor Author

Good catch @nnegrey. I moved all the 4 beta samples to beta_snippets.py and simplified a few things.

@happyhuman happyhuman requested a review from tswast July 12, 2018 21:10
@happyhuman happyhuman changed the title [DO NOT MERGE] Diarization Diarization Jul 12, 2018
@happyhuman happyhuman merged commit ad4ae03 into master Jul 12, 2018
print('First alternative of result {}: {}'
.format(i, alternative.transcript))
print('Speaker Tag for the first word: {}'
.format(alternative.words[0].speaker_tag))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does each word in alternative.words have other relevant information concerning diarization besides speaker_tag? if so print them as well, if not please ignore this comment.

os.path.join(RESOURCES, 'Google_Gnome.wav'))
out, err = capsys.readouterr()

assert 'OK Google stream stranger things from Netflix to my TV' in out
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are there two different speakers in this audio file? if so assert something about the speaker_tags being returned correctly.

@happyhuman happyhuman deleted the diarization branch July 13, 2018 23:17
busunkim96 pushed a commit to busunkim96/python-speech that referenced this pull request Sep 1, 2020
busunkim96 pushed a commit to googleapis/python-speech that referenced this pull request Sep 3, 2020
telpirion pushed a commit that referenced this pull request Jan 13, 2023
dandhlee pushed a commit that referenced this pull request Feb 9, 2023
telpirion pushed a commit that referenced this pull request Mar 13, 2023
atulep pushed a commit to googleapis/google-cloud-python that referenced this pull request Apr 6, 2023
atulep pushed a commit to googleapis/google-cloud-python that referenced this pull request Apr 6, 2023
atulep pushed a commit to googleapis/google-cloud-python that referenced this pull request Apr 18, 2023
parthea pushed a commit to googleapis/google-cloud-python that referenced this pull request Oct 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla: yes This human has signed the Contributor License Agreement.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants