Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

πŸ“š [Contribution] ebook2audiobook roadmap #32

Closed
55 of 73 tasks
DrewThomasson opened this issue Oct 11, 2024 · 11 comments
Closed
55 of 73 tasks

πŸ“š [Contribution] ebook2audiobook roadmap #32

DrewThomasson opened this issue Oct 11, 2024 · 11 comments
Assignees
Labels
Docs Improvements or additions to documentation Help wanted Extra attention is needed Open to public contributions open to public contributions TODOs

Comments

@DrewThomasson
Copy link
Owner

DrewThomasson commented Oct 11, 2024

All Features open to public Contributions ⭐

Wanted Extra Parameters

My Other Repos I Want to Integrate into the App for Extra Options :)

Create a standard function for load_model() and inference_model() for:

  • ⓍXTTSv2
  • Styletts2
  • πŸͺˆ Piper-tts
  • 🐢 Bark tts
  • Fairseq
  • Vits
  • YourTTS
  • Zonos
  • F5-TTS

Create Readme in these languages

  • Arabic (ara)
  • Chinese (zho)
  • English (eng)
  • Spanish (spa)
  • French (fra)
  • German (deu)
  • Italian (ita)
  • Portuguese (por)
  • Polish (pol)
  • Turkish (tur)
  • Russian (rus)
  • Dutch (nld)
  • Czech (ces)
  • Japanese (jpn)
  • Hindi (hin)
  • Bengali (ben)
  • Hungarian (hun)
  • Korean (kor)
  • Vietnamese (vie)
  • Swedish (swe)
  • Persian (fas)
  • Yoruba (yor)
  • Swahili (swa)
  • Indonesian (ind)
  • Slovak (slk)
  • Croatian (hrv)

Binary builds Working pyinstaller script for:

  • 🍎 Mac Intel x86
  • πŸͺŸ Windows x86
  • 🐧 Linux x86
  • πŸ–₯️🍏 Apple Silicon Mac
  • πŸͺŸπŸ’ͺ ARM Windows
  • 🐧πŸ’ͺ ARM Linux

🐍 Single pip command install that works for:

  • being overseen by @ROBERT-MCDOWELL
  • 🍎 Mac Intel x86
  • πŸͺŸ Windows x86
  • 🐧 Linux x86
  • πŸ–₯️🍏 Apple Silicon Mac
  • πŸͺŸπŸ’ͺ ARM Windows
  • 🐧πŸ’ͺ ARM Linux

Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command)

For higher level developers:

  • Integrate VoxNovel experimental functionality into this 🀷 eventually. . .

Wanted Auto-testing scripts for development

@DrewThomasson if you want to help out at all! πŸ˜ƒ

@DrewThomasson DrewThomasson added the Docs Improvements or additions to documentation label Oct 11, 2024
@DrewThomasson DrewThomasson pinned this issue Oct 11, 2024
@DrewThomasson DrewThomasson changed the title πŸ“š ebook2audiobookxtts roadmap πŸ“š [Want to contribute?] ebook2audiobookxtts roadmap Oct 12, 2024
@DrewThomasson DrewThomasson added Help wanted Extra attention is needed Open to public contributions open to public contributions TODOs and removed Docs Improvements or additions to documentation labels Oct 13, 2024
@DrewThomasson DrewThomasson added the Docs Improvements or additions to documentation label Oct 14, 2024
@ROBERT-MCDOWELL
Copy link
Collaborator

ROBERT-MCDOWELL commented Oct 15, 2024

Another interesting option would be to change voices between chapters. for i.e.:
--voice_mapping {"chapters": {1:"john.wav",2:"stella.wav",3:"child.wav",4:"random"} }
so the selected chapters will see their voice mapped, others will keep the main --voice intact.

@DrewThomasson
Copy link
Owner Author

Another interesting option would be to change voices between chapters. for i.e.: --voice_mapping {"chapters": {1:"john.wav",2:"stella.wav",3:"child.wav",4:"random"} } so the selected chapters will see their voice mapped, others will keep the main --voice intact.

@ROBERT-MCDOWELL Added to roadmap checklist

@DrewThomasson
Copy link
Owner Author

Compare tts

#35 (comment)

@DrewThomasson
Copy link
Owner Author

Translate ebook to X language
#35 (comment)

@geneliu
Copy link

geneliu commented Dec 31, 2024

Integration with https://github.com/janeczku/calibre-web

@ROBERT-MCDOWELL
Copy link
Collaborator

@geneliu how you would see it?

@DrewThomasson
Copy link
Owner Author

DrewThomasson commented Jan 2, 2025

flask api convo #179 (comment)

@ROBERT-MCDOWELL ROBERT-MCDOWELL unpinned this issue Jan 8, 2025
@ROBERT-MCDOWELL ROBERT-MCDOWELL changed the title πŸ“š [Want to contribute?] ebook2audiobookxtts roadmap πŸ“š [ontribution] ebook2audiobookxtts roadmap Feb 18, 2025
@ROBERT-MCDOWELL ROBERT-MCDOWELL changed the title πŸ“š [ontribution] ebook2audiobookxtts roadmap πŸ“š [Contribution] ebook2audiobookxtts roadmap Feb 18, 2025
@ROBERT-MCDOWELL ROBERT-MCDOWELL pinned this issue Feb 18, 2025
@ROBERT-MCDOWELL ROBERT-MCDOWELL changed the title πŸ“š [Contribution] ebook2audiobookxtts roadmap πŸ“š [Contribution] ebook2audiobook roadmap Feb 21, 2025
ROBERT-MCDOWELL added a commit that referenced this issue Feb 23, 2025
ROBERT-MCDOWELL added a commit to ROBERT-MCDOWELL/ebook2audiobook that referenced this issue Feb 23, 2025
ROBERT-MCDOWELL added a commit that referenced this issue Feb 23, 2025
@DrewThomasson
Copy link
Owner Author

More tts models I will look at figuring out how to add to coqui tts (hopefully if I can figure it out) as well as my ultimate goal of making a PR to coqui to add styletts2

https://github.com/karim23657/awesome-Persian-Speech?tab=readme-ov-file

@DrewThomasson
Copy link
Owner Author

Potentially adding epub3 as a output format

Example being storyteller

https://www.reddit.com/r/Python/s/vt8DsiogW8

@ROBERT-MCDOWELL
Copy link
Collaborator

Potentially adding epub3 as a output format

Example being storyteller

https://www.reddit.com/r/Python/s/vt8DsiogW8

we can do it already, it needs just to specify epub3 as output extension at the conversion.

@ROBERT-MCDOWELL
Copy link
Collaborator

correction, It does not need to add the extension, just add some options... ready for next PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Docs Improvements or additions to documentation Help wanted Extra attention is needed Open to public contributions open to public contributions TODOs
Projects
None yet
Development

No branches or pull requests

3 participants