I also wrote a similar timing test program for creating objects using reflection. The differences for this case aren't nearly as significant as for the field and method call cases, though.
In that article, we just passed the class and method name as command-line parameters to my program, which loaded the binary class file, added the timing code, then wrote the class back out.
When the application makes an intensive use of timing functions, a more efficient implementation of the timing routines can improve the overall performance of the program.