Visualizing audio is a problem. The only available method to learn speech intonations is by listening to audio and trying to imitate or learning by chatting with a conversation agent. However, if we can visualize raw speech audio intuitively, it can enable a better learning process by enabling comparisons between the audio and visual mediums to better understand the speech structure.