That's my main point as well. Humans are very sound dependant. In the Youtube early days music quality was bound to visual quality, but they found out that people would rather watch low resolution videos with good sound than vice versa.To be frank, I think the music is too intense.
You should work on the music.
Another problem you might suffer (now). Is visual guidance. If I watch that video on a small screen 17" or in a window, then the coherence of ASCII symbols is give.... I can easily see what you want to display. But on my 24" or 50" screens the individual parts start to lose their (sorry) coherence. Sometime I have to think for a second or two what is shown befor me. Working with more levels of green could help.