Thanks to Haacked for posting this one. I haven't yet read it fully, but this IEEE paper shows good results when doing test-first development vs. other methods.
"We found that test-first students on average wrote more tests and, in turn, students who wrote more tests tended to be more productive. We also observed that the minimum quality increased linearly with the number of programmer tests, independent of the development strategy employed."
"The external validity of the results could be limited since the subjects were students. Runeson [21] compared freshmen, graduate, and professional developers and concluded that similar improvement trends persisted among the three groups. Replicated experiments by Porter
and Votta [22] and Höst et al. [23] suggest that students may provide an adequate model of the professional population."
To read a good summary of that paper go to Phil's post and read it to the end.
Update: Jacob rips the report to shreds.