I think the true problem lies not really in the fact that it SOUNDS like a robot, but that it talks like a mindless drone. It doesn't understand the sentence structure, tone, tempo, the point of the sentence, it doesn't know where to put emphasis, etc. That is what makes comprehension difficult.
It would have to be a very sophisticated program that could accurately provide these functions.