My idea is to write a report and each word of that report words acts as a button where I press that word the special sound is played.
You will need to put a collider over each word and then use OnMouseDown to trigger the sound. Just one potential problem. If the text gets reformatted depending on the device (mobile/Desktop etc), the mapping could be wrong. Individual Gameobjects might work but it’s a lot of work…