Welcome to the forum.
If I understood correctly, you are supplying the document as an image and you want the model to answer with a precise bounding box for the text (the model properly identifies and returns the text, but the bounding box it returns is not precise enough). If that is the case, then also check for any answers to this post - Bounding Box Alignment Problems and Image Rescaling
Hope that helps.