Thanks for reading. If the table is embedded between text paragraphs, you need to use “area” option to crop the table, as I mentioned in the article, so that you don’t need to deal with the irrelevant text. For non-uniform tables, you might need to define multiple areas in the pdf page then clean the data.

Data Science | Machine Learning | Economics Consulting https://www.linkedin.com/in/aaron-zhu-53105765/