Convert PDF into an audiobook.
-
Updated
Nov 9, 2020 - Python
Convert PDF into an audiobook.
Extracting details from Resume(CVs) and matching with Job Description(JDs) using pretrained model like DistilBERT and ranking them using cosine similarity.
This project facilitates the extraction of text from PDF files using various Python libraries. It is designed to be flexible, allowing the choice among different text extraction libraries and supporting both single PDF file and directory containing multiple PDF files.
NLP model for extracting chinese datas from the documents
This is my exploration of a variety of Python 🐍 libraries. I have built geospatial data analytics systems from CSV files, Image and video processing tools like face detection and motion detection. I also built a website with flask (and three.js), I built apps connecting to several types of databases. Created a simple budgeting app that reads, wr…
Interface developed to extract information from web through scraping and summarize given data.
A modern web application that integrates a conversational AI chatbot with real-time user interactions, including file uploads and smooth animations. Built using React, Framer Motion, Lucide Icons, and ShadCN Components on frontend and fastapi on backend.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
Scrapes data tables from a PDF file.
A sample script to extract text data from a pdf file, converts it to a pandas data frame, and saves it into a CSV file.
PDF to MP3, audio book convertor
collecting data from the Barcelona City Hall Open Data Service's on socioeconomic indicators of the territorial division of the city of Barcelona
Программа парсит несколько pdf-отчетов, ищет необходимую информацию о серии и флаконах, формирует отчет и создает excel-файл с отчетом.
Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.
Ready to use Python application/file for parsing a specific format of pdf form, and storing relevant user data in a tabular format in excel sheet
This repository contains a Python script for comparing PDF files between a local source folder and a remote server. The script logs results, highlighting identical and non-identical files based on size and page count. It employs "pdfplumber" for PDF handling and "paramiko" for SSH connections.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
NLP-based resume parsing tool for extracting relevant info and rank candidates for job applications
Add a description, image, and links to the pdfplumber topic page so that developers can more easily learn about it.
To associate your repository with the pdfplumber topic, visit your repo's landing page and select "manage topics."