Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Android run demo by small model #518

Open
qcxu2 opened this issue Feb 21, 2023 · 4 comments
Open

Android run demo by small model #518

qcxu2 opened this issue Feb 21, 2023 · 4 comments
Labels
performance CPU and memory usage - results and comparisons

Comments

@qcxu2
Copy link

qcxu2 commented Feb 21, 2023

I run the demo on a real Android machine and copied the samll model, but the conversion time is too long. Is it a machine problem?

@ron-diesel
Copy link

I have the same problem, Samsung s10e exynos, base data model

@tinoue
Copy link
Contributor

tinoue commented Feb 27, 2023

Make sure to use release build and select tiny or base model.
On my Redmi note 9s (Cortex-A75), 48sec of voice sample can be transcribed in 37sec with base model.

@ggerganov ggerganov added the performance CPU and memory usage - results and comparisons label Feb 27, 2023
@ggerganov
Copy link
Owner

In addition to @tinoue comment, in the future you might be able to use quantised models for better performance on mobile devices - follow #540 for more informatiom

@VipinVIP
Copy link

VipinVIP commented May 7, 2023

Make sure to use release build and select tiny or base model.

This. There were a huge difference in audio conversion times between a debug apk and a release one.

Model Variant Input audio time Time for ranscribing
Tiny ( ggml-tiny.en.bin ) Debug 11001 ms (11 sec) 37519 ms (37.51 sec)
Release 11001 ms (11 sec) 5123 ms (5.12 sec )

Using a quantized model had almost no conversion time difference , but a major advantage in apk size , without compromising in Accuracy (in my observation)

Model Variant Input audio time Time for ranscribing
Tiny Q4_2 ( ggml-tiny.en-q4_2.bin ) Debug 11001 ms (11 sec) 46517 ms (46.51 sec)
Release 11001 ms (11 sec) 5309 ms (5.3 sec )

Also , clicking the benchmark button in android crashes the app. Don't know the reason since im not much familiar with android.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance CPU and memory usage - results and comparisons
Projects
None yet
Development

No branches or pull requests

5 participants