Thanks for reading. If the table is embedded between text paragraphs, you need to use “area” option to crop the table, as I mentioned in the article, so that you don’t need to deal with the irrelevant text. For non-uniform tables, you might need to define multiple areas in the pdf page then clean the data.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Aaron Zhu

Senior Data Analyst | Always looking for new and exciting ways to turn complex data into actionable insights | https://www.linkedin.com/in/aaron-zhu-53105765/