🚀 New Features
- **Video Analysis**: Perform frame-by-frame analysis of video content using advanced OpenAI and OpenRouter Vision models.
- **Audio Transcription**: Integrated Whisper model for precise audio-to-text transcription.
- **Dynamic Frame Selection**: Automated selection of relevant frames, optimizing processing and providing focused insights.
- **Scene Change Detection**: Identify transitions and scene changes to support context-aware analysis.
- **AI-Powered Summarization**: Generate cohesive summaries that combine video and audio elements for a comprehensive overview.
- **Customizable Prompts and Models**: Define custom prompts and configure model preferences to tailor analysis to specific project needs.
- **Metadata Extraction**: Collect valuable metadata, enabling deeper insights and facilitating data-driven applications.
🔑 Improvements
- **API Key Management**: Support for setting OpenAI and OpenRouter API keys via environment variables for seamless integration.
- **Logging Enhancements**: Configurable logging options for easier debugging and monitoring of analysis processes.
📦 Installation
- **Python 3.10+ Compatibility**: Optimized for Python 3.10 and above, ensuring compatibility with the latest language features.
- **FFmpeg Integration**: Prerequisite FFmpeg setup for smooth video handling and frame processing.
📝 Documentation
- **Detailed Usage Examples**: Comprehensive guides for quick-start and advanced configurations.
- **Prompt Configuration Guide**: Tips and examples on crafting effective prompts for optimized analysis.
- **Application Examples**: Use cases in content moderation, media, education, and accessibility.
🚀 Future Enhancements (Planned)
- **Real-Time Video Analysis**: Upcoming feature for real-time processing of live video streams.
- **Multi-Language Support**: Expansion of transcription capabilities to support additional languages.
- **Enhanced Metadata Extraction**: Upcoming improvements for key phrase tagging and enriched metadata insights.
**OpenSceneSense** is now live and ready to empower developers and researchers with the tools to unlock advanced video insights.