I'm excited to announce the first release of Local-NotebookLM, a powerful AI-driven tool that transforms PDF documents into engaging podcasts using local LLMs and TTS models! ๐
What's included in this release: โจ
- **Complete PDF processing pipeline** ๐ - extract text from PDFs and convert them into conversational content
- **Flexible model support** ๐ค - works with OpenAI, Groq, Azure, LMStudio, Ollama, and custom endpoints
- **Multiple output formats** ๐๏ธ - generate summaries, podcasts, articles, or interviews
- **Customizable styles and lengths** ๐ - from short, casual summaries to in-depth technical discussions
- **Text-to-Speech integration** ๐ - convert transcripts to natural-sounding audio with multiple voices
- **Command-line interface** ๐ป - easy to use with comprehensive options
- **Programmatic API** ๐งฉ - integrate into your own Python projects
- **FastAPI server** ๐ - access functionality through a web API
Getting started: ๐
bash
pip install Local-NotebookLM
python -m local_notebooklm.start --pdf my_document.pdf
Why use Local-NotebookLM? ๐ค
This tool bridges the gap between static PDFs and dynamic audio content, making information more accessible and engaging. Perfect for researchers, content creators, educators, or anyone who wants to consume PDF content in a more accessible format.
What's next: ๐ฎ
I'm planning future releases to improve audio quality, add more customization options, and expand model compatibility. Your feedback is essential for guiding my development priorities!
---
Check out the example podcast in the repository to hear what Local-NotebookLM can produce. For detailed documentation, configuration options, and usage examples, please refer to the README.md.
Best, Gรถkdeniz Gรผlmez ๐