Combine and Extract multiple PDF tables to clean Excel Data using Tabula library of python

Описание к видео Combine and Extract multiple PDF tables to clean Excel Data using Tabula library of python

In this video, we will explore tabula library of Python to combine, convert and extract multiple pdf tables to cleaned excel data ready for further analysis.

We will also use pandas library of python to clean Data and do further data cleaning.

If you have java installed already and still getting an error, then please try below steps, the java setup is bit tricky but hopefully a one time setup.

from windows start option, search for Environment Variables and search for *Edit environment variables*, then follow below steps:

**
Under the System Variables click Path and then press the Edit... instead of New. Then in the next screen (Edit environment variable for the Path variable) click New and add the address, e.g. C:\Program Files (x86)\Java\jre1.8.0_201\bin. Press OK and the Path variable will be appended/updated.**

Answer taken from below:
https://stackoverflow.com/questions/5...

Python Source code:
https://codepad.site/edit/q9aig7rj

Комментарии

Информация по комментариям в разработке