A Python script that converts PDF files to CSV format using the Tabula library.
- Python 3
- Tabula library
- Pandas library
- Clone this repository:
- Install the Tabula and Pandas libraries:
- pip install tabula pandas
- Put your PDF files in the
pdf
directory. - Run the script with the following command:
- python convert.py
- The CSV files will be generated in the
csv
directory.
-f
or--file
: specify a single file to convert (e.g.--file example.pdf
).-p
or--pages
: specify which pages to convert (e.g.--pages 1-3,5
).-s
or--stream
: stream output to stdout instead of writing to file.-d
or--debug
: enable debug mode.
Contributions are welcome! If you would like to contribute to this project, please follow these steps:
- Fork this repository.
- Create a new branch:
git checkout -b feature-branch
- Make your changes and commit them:
git commit -m "Add some feature"
- Push to the branch:
git push origin feature-branch
- Submit a pull request.
If you have any questions or comments, please contact me at montenegrodiussantos1@gmail.com