PdfS is a powerful PDF processing application that allows you to summarize, query, and listen to PDF documents using state-of-the-art natural language processing models.
- PDF Summarization: Generate concise summaries of your PDF documents
- Question Answering: Ask questions about your PDF content and get precise answers
- Text-to-Speech: Listen to your PDF documents with customizable voice options
- Page Range Selection: Process specific pages of your PDF documents
- Download the zip file or Clone the repository
git clone https://github.com/pratham-ak2004/PdfS.git
cd PdfS- Install the required dependencies
pip install -r requirements.txt- Run the application
python main.py-
Open the provided URL in your browser (typically http://127.0.0.1:7860)
-
Upload your PDF document using the document viewer
-
Use the available functions:
- Set the page range (optional)
- Click the "Summarize" button
- View the generated summary in the text area
- Type your question in the text area
- Click "Ask Question"
- View the answer as a notification and in the text area
- Select your preferred voice from the dropdown menu
- Click "Read" to generate an audio file
- Use the audio player to listen to the PDF content
- Start Page/End Page: Set specific page ranges to process
- Max Summarization Tokens: Limit the amount of text processed for summarization
- Max/Min Length: Control the length of the generated summary
- Voice Selection: Choose from available system voices for text-to-speech
- Summarization:
pszemraj/led-base-book-summary - Question Answering:
deepset/roberta-base-squad2