I want to send a pdf file in batches and extract questions from it.
It is a course book.
Problem is now that if I split it into batches of 10 pages or so, that I may split a question in half. Which then leads to a wrongly extracted question or for that matter two questions that are not complete.
Any idea how to resolve it such that the questions that are in two batches are not split?