Moving to a smaller production size does not improve efficiency...it simply allows for a bit higher clockability and less power consumption.
DDR2 won't show any performance increases.
"Improving the architecture's efficiency." That is a result, not an action. You need to explain how they are accomplishing this, or else its nothing.
The additional complex instruction decoder may indeed help, but not enough.
Reverse HyperThreading - Um...maybe not, lol.
I think that the maximum performance increase possible from Conroe is going to be 20%. No more, it just won't happen. Not from an architectural revision.
As for the Silicon-Germanium transistors...where did you get that? I know those are the same ones used in that IBM chip, but where did you get that they're going to be using those in K8L?