I think you should try to fill the frame more, I am not sure what the center for attention is here.
What I do is try to decide what is interesting in the scene, then get as close as I can read:(reasonably long lens, not to interfere)
The guy holding the flag, seems like he could be interesting.
I agree with KjetilNorway – I don’t know what the focus of attention is here.
I think it is the guy holding the sign but I am not sure. Too many other things vying my attention. Shallower DOF may remove some of the interest from the guy in the foreground left.
I think I might know what you are trying to do, which is take an image to show the attention paid by people pay to buskers??? Well, compostion is ok, a certain balance exists, but nothing really comes together in this one. If I may offer an opinion: stay away from buskers and street performers overall, they are too easy and are generally dull photographically. But, saying that, they can be made to work, I have seen others do it. Just beats me.