I’ve been messing around with terrains for awhile and for the most part I’ve used 3rd person cameras for this effect but I’ve never been able to get it exactly right. I’m sure the majority of us have played FFVII and IX (I didn’t like VIII) and the one thing you notice is that the person(player) is quite small on the map, so I don’t know if the camera is just zoomed out in third person or set at a perspective?
The image below illustrates what I’m trying to do, the only difference is that my player will be a sprite instead of a model since the rest of the game is in 2D with some 3D scenes.
You could simply add position offsets to the script that you have managing your camera, and from there could dynamicly change how you position the camera behind the player - allowing you to pull the camera back quite a distance, in similar fashion to what you have shown in your post.