v25.12.33 by ROBERT-MCDOWELL · Pull Request #1508 · DrewThomasson/ebook2audiobook

ROBERT-MCDOWELL · 2026-01-10T02:51:16Z

No description provided.

Copilot

Pull request overview

This pull request (v25.12.33) introduces significant refactoring to the TTS (Text-to-Speech) engine architecture and audio processing pipeline. The changes focus on improving GPU/device handling, VTT subtitle generation, and text processing logic.

Changes:

Refactored GPU policy handling to support multiple device types (CUDA, ROCm, MPS, XPU) with improved AMP dtype selection
Moved VTT subtitle generation from inline processing to a deferred batch operation using audio file analysis
Extracted common TTS engine methods (_set_voice, _convert_sml) to a shared utility base class
Added text merging logic for very short rows in join_ideogramms function
Fixed template parameter name in time format string substitution

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 19 comments.

Show a summary per file

File	Description
lib/gradio.py	Refactored event handler to chain voice list updates; simplified return values for change_gr_fine_tuned_list
lib/core.py	Added text merging pass, fixed time format parameter, refactored convert_chapters2audio to track sentences separately, extracted get_audio_duration function
lib/classes/tts_manager.py	Added create_sentences2vtt method to support deferred VTT generation
lib/classes/tts_engines/*.py	Removed per-sentence VTT tracking, added torch.autocast for GPU acceleration, refactored to use common _set_voice/_convert_sml methods
lib/classes/tts_engines/common/utils.py	Replaced _apply_cuda_policy with comprehensive _apply_gpu_policy supporting multiple devices; added _build_vtt_file for batch VTT generation; extracted _set_voice and _convert_sml methods

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lib/classes/tts_engines/common/utils.py

lib/core.py

lib/classes/tts_engines/xtts.py

lib/classes/tts_engines/tacotron.py

lib/core.py

lib/classes/tts_engines/yourtts.py

lib/classes/tts_engines/vits.py

lib/classes/tts_engines/bark.py

ROBERT-MCDOWELL added 12 commits January 6, 2026 20:06

...

2483f3a

...

b67cfea

...

0765722

...

c45f634

...

25591da

...

e36e0d0

...

16a3ab4

...

7367683

...

982d309

...

dd5c6a5

...

297c676

...

4baafab

Copilot AI review requested due to automatic review settings January 10, 2026 02:51

ROBERT-MCDOWELL merged commit f309ec6 into DrewThomasson:v25 Jan 10, 2026
2 checks passed

Copilot started reviewing on behalf of ROBERT-MCDOWELL January 10, 2026 02:51 View session

Copilot AI reviewed Jan 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v25.12.33#1508

v25.12.33#1508
ROBERT-MCDOWELL merged 12 commits intoDrewThomasson:v25from
ROBERT-MCDOWELL:v25

ROBERT-MCDOWELL commented Jan 10, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ROBERT-MCDOWELL commented Jan 10, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants