Coverting Asm to C: Metrics?

Question

G'day All,

I'm trying to estimate the effort involved in a project converting some well structured and commented (not mine, needless to say) 8051 assembler to C. As a first estimate, naturally, I'm performing the task on a small part of the code and extrapolating. Does anyone have (or know of) any metrics for a similar project as a sanity check. Doesn't need to be 8051 to C, any non automated assembler to HLL conversion stats would be useful.

Thanks, Alf

Donald · Accepted Answer

Hello Alf,One mans "well documented code" is another mans "just spaghetti code".Whenever I am asked to convert assembly to C, I bid for a redesign with documentation up front.They way most assembly for micros are written is horrid at best.Then....."But we just want to add a little extra code to ...."This is where it will fall on its face.Good Luck, you will need it.donald

Steve at fivetrees · Answer

I've done this sort of thing many times. I usually wind up deriving a functional spec, and maybe an architecture, from the existing assembler and re-writing clean code in C.

Even with ideal assembler, the idioms are different. With less than ideal assembler, the idioms are just plain wrong ;). I can't give you metrics; some things are hard, some things are easy.

(FWIW, I often spot a bunch of bugs in the assembler in doing such conversion exercises. But I can generally extract what the coder *intended* to do, rather than what actually happens. IOW, it's not a bad way of doing a code review and a sanity check.)

Steve

formatting link

Jim Granville · Answer

This can be very open-ended ( but you probably already know that ... )

Is this code moving to another platform, or staying on the C51, and getting a C makeover, to make new features easier to control ?

If it is staying on the C51, you can quote in stages, and move the proven/stable asm to libraries, where C can call it. ( ie don't rewrite what you don't have to ... )

You will also be able to have test-able sign-off stages, as this step should be an operational clone of the existing system.

Then, you can add all the new features in C...

-jg

Vivekanandan M · Answer

Hello ,

You can collect the following metrics, a) Size of the code generated for the C code and compare thesize with that of the existing ASM code. Set a limit to say the code generated by the C code should be less that 1.5/2 times that of the ASM code. b) Do a code coverage and profiling analysis for the ASM and the C code.

May I know the exact reason for this conversion?

Best Regards, Vivekanandan M

Walter Banks · Answer

I have found the metrics interesting in the cases we have done this. Evolved code is rarely that clean and compilers generally do a considerably better job at local variable allocation and placement than assembler programmers. Don't be surprised that the generated C results in shorter faster applications with less RAM requirements.

Of course I am biased.

w..

cbarn24050 · Answer

Fat chanceMaybe on a C friendly machine but not much chance on a 8051.Be amased if it doesAgreed.

Spehro Pefhany · Answer

Was the assembly code originally written by someone who is a also skilled C programmer?

I'd guess somewhat longer than it would take to write the C program from scratch (given good documentation), because you have to extract the design, evaluate it, probably modify aspects of it, and then implement it.

Best regards, Spehro Pefhany

Jonathan Kirwan · Answer

hehe, yes you are.I've never had the experience of a C compiler reducing my code anddata footprint or improving on the execution time.  But then, I'veonly had one truly comparable experience where I was paid to actuallyport an assembly-coded, full-up application that I'd also written, asexactly as I could manage into C.  The size expanded dramatically onall points and I was seriously trying to write good maintainable C andaccurately reflect the details of operation.  I'm not ignorant oflibrary issues, nor numerical methods, and I feel I applied a gooddegree of expertise in writing the C code.Even in the case of small routines, where I'm forced to apply themodel required by C for interfacing purposes (frames, stack unwindsupport if appropriate, register preservation, etc), I don't find yetany C compiler able to improve on execution -or- space.Jon

Chris Quayle · Answer

Start from the right end by defining what the product does, (I know this is obvious :-) perhaps as some sort of functional spec. Then, the existing code major functional bloacks, relationship between modules in terms of how data gets passed around the system, how many code banks are involved and relationships between const data, code and common area dependencies etc. If the code is well documented already, you have a head start, but the docs and code comments may not equal current reality. You could spend weeks or even months trying to work out what the old code is doing. Then you need to check for bugs and subtle 'intended' side effects etc. You have to assume that all the old code is suspect.

One major obstacle to analysis of older systems is that there can be global data all over the place and if the code has been heavily modified over the years, can be a nightmare to determine what global gets used where and when. You will also have to modify or write wrappers for the asm modules that you decide to keep, to conform to the calling conventions of the C compiler.

IME, reuse of a worn out code base ends up as a dog's dinner. It's usually quicker to rewrite the whole lot. Sharp pencil, nice new clean code base, well structured etc - you just have to convince the management :-)...

Chris

Steve at fivetrees · Answer

Completely agree. Nicely put.But - the old codebase is useful in terms of analysing:  - What was intended  - The workarounds employed to get it to work as intendedSteve

steve · Answer

the old code is only thing that is not suspect, it's everything else that's suspect (comments, low level requirements, customer spec, tests etc), the only thing that you can be sure of that accurately reflects the desired product performance and purpose is the old program code, it's the only thing that was guaranteed to be maintained, however poor the implementation

Coverting Asm to C: Metrics?

Join the Discussion

Didn't find your answer?