Princeton University
COS 217: Introduction to Programming Systems

Assignment 4: Assembly Language Programming and Testing

Purpose

The purpose of this assignment is to help you learn about computer architecture, assembly language programming, and testing strategies. It also will give you the opportunity to learn more about the GNU/Unix programming tools, especially bash, emacs, gcc, gdb, and gprof.

The assignment consists of two parts, each of which has subparts. We encourage you to complete Part 1 by the end of the first week of the assignment period.

Rules

Part 2f, as defined below, is the "extra challenge" part of this assignment. While doing the "extra challenge" part of the assignment, you are bound to observe the course policies regarding assignment conduct as given in the course Policies web page, plus one additional policy: you may not use any "human" sources of information. That is, you may not consult with the course's staff members, the lab teaching assistants, other current students via Piazza, or any other people while working on the "extra challenge" part of an assignment, except for clarification of requirements.

The extra challenge part is worth 11 percent of this assignment. So if you don't do any of the "extra challenge" part and all other parts of your assignment solution are perfect and submitted on time, then your grade for the assignment will be 89 percent.

Part 1: A Word Counting Program in Assembly Language

Part 1a: Translate to Assembly Language

The Unix operating system has a command named wc (word count). In its simplest form, wc reads characters from stdin until end-of-file, and writes to stdout a count of how many lines, words, and characters it has read. A word is a sequence of characters that is delimited by one or more white space characters.

Consider some examples. In the following, a space is shown as _s and a newline character as _n.

If the file named proverb contains these characters:

Learning_sis_sa_n
treasure_swhich_n
accompanies_sits_n
owner_severywhere._n
--_sChinese_sproverb_n

then the command:

$ wc < proverb

writes this line to stdout:

  5 12 82

If the file proverb2 contains these characters:

Learning_sis_sa_n
treasure_swhich_n
accompanies_sits_n
owner_severywhere._n
--_sssChinese_sproverb

(note that the last "line" does not end with a newline character) then the command:

$ wc < proverb2

writes this line to stdout:

  4 12 83

The FC010 /u/cos217/Assignment4 directory contains a file named mywc.c. That file contains a C program that implements the subset of the wc command described above. Translate that program into assembly language, thus creating a file named mywc.s. Note that the given mywc.c program uses global variables, and so your mywc.s must use global variables too. That is, your mywc.s program must store data in the DATA and/or BSS sections. Your assembly language program must behave exactly the same (i.e. must write exactly the same characters to stdout) as the given C program does.

Part 1b: Test

Design a test plan for your mywc program. Your test plan must include tests in three categories: (1) boundary testing, (2) statement testing, and (3) stress testing.

Create text files to test your programs. Name each such file such that its prefix is mywc and its suffix is .txt. The command ls mywc*.txt must display the names of all mywc test files, and only those files.

Describe your mywc test plan in your readme file. Your description must have this structure:

mywc boundary tests:

mywcXXX.txt: Description of the characteristics of that file, and how it tests boundary conditions of your mywc program.

mywcYYY.txt: Description of the characteristics of that file, and how it tests boundary conditions of your mywc program.

...
mywc statement tests:

mywcXXX.txt: Description of the characteristics of that file, and which statements of your mywc program it tests. Refer to the statements using the line numbers of the given mywc.c program.

mywcYYY.txt: Description of the characteristics of that file, and which statements of your mywc program it tests.

...

Your descriptions of the test files must be of the form "This file contains such-and-such characteristics, and so tests lines such-and-such of the program." Identify the lines of code tested by line numbers. The line numbers must refer to the given C code.
mywc stress tests:

mywcXXX.txt: Description of the characteristics of that file, and how it stress tests your mywc program.

mywcYYY.txt: Description of the characteristics of that file, and how it stress tests your mywc program.

...

In a more realistic context, your test files should contain non-ASCII character codes. However, in the context of this assignment, submit test files that contain character codes for only printable characters. Specifically, make sure your computer-generated test files contain only character codes (in hexadecimal) 09, 0A, and 20 through 7E. It would be difficult for your grader to examine files that contain other character codes.

You may submit as many test files as you want. However at most three of your test files may be large, and a large test file must contain no more than 50000 characters and no more than 1000 lines. It would be difficult for your grader to scroll through a test file that exceeds those limits.

The FC010 /u/cos217/Assignment4 directory contains bash shell scripts named testmywc and testmywcdiff that automate your testing. Comments at the beginning of those files describe how to use them. After copying the scripts to your project directory, you may need to execute the commands chmod 700 testmywc and chmod 700 testmywcdiff to give them "executable" permissions.

Part 2: Beat the Compiler

Background

Many programming environments contain modules to handle high-precision integer arithmetic. For example, the Java Development Kit (JDK) contains the BigDecimal and BigInteger classes.

The Fibonacci numbers are used often in computer science. See http://en.wikipedia.org/wiki/Fibonacci_numbers for some background information. Note in particular that Fibonacci numbers can be very large integers.

This part of the assignment asks you to use assembly language to compose a minimal but fast high-precision integer arithmetic module, and use it to compute large Fibonacci numbers.

Part 2a: Add `BigInt` Objects Using C Code

Suppose you must compute Fibonacci number 500000, that is, fib(500000)...

The /u/cos217/Assignment4 directory contains a C program that computes Fibonacci numbers. It consists of two modules: a client and a BigInt ADT.

The client consists of the file fib.c. The client accepts an integer x as a command-line argument, where x must be a non-negative integer. The client computes and writes fib(x) to stdout as a hexadecimal number. Then it writes to stderr the amount of CPU time consumed while performing the computation. Finally the client performs some boundary condition and stress tests, writing the results to stdout. The client module delegates most of the work to BigInt objects.

The BigInt ADT performs high precision integer arithmetic. It is a minimal ADT; essentially it implements only an "add" operation. The BigInt ADT consists of four files:

bigint.h is the interface. Note that the ADT makes eight functions available to clients: BigInt_new, BigInt_free, BigInt_assignFromHexString, BigInt_largest, BigInt_random, BigInt_writeHex, BigInt_writeHexAbbrev, and BigInt_add.
bigint.c contains implementations of the BigInt_new, BigInt_free, BigInt_assignFromHexString, BigInt_largest, BigInt_random, BigInt_writeHex, and BigInt_writeHexAbbrev functions.
bigintadd.c contains an implementation of the BigInt_add function.
bigintprivate.h is a private header file — private in the sense that clients never use it. It allows code to be shared between the two implementation files, bigint.c and bigintadd.c.

Study the given code. Then build a fib program consisting of the files fib.c, bigint.c, and bigintadd.c, with no optimizations (that is, without the -D NDEBUG option and without the -O option, as described below). Run the program to compute fib(500000). In your readme file note the amount of CPU time consumed.

Part 2b: Add `BigInt` Objects Using C Code Built with Compiler Optimization

Suppose you decide that the amount of CPU time consumed is unacceptably large. You decide to command the compiler to optimize the code that it produces...

Build the fib program using optimization. Specifically, build with the -D NDEBUG option so the preprocessor disables the assert macro, and the -O (that's an upper case "oh") option so the compiler generates optimized code. Run the resulting program to compute fib(500000). In your readme file note the amount of CPU time consumed.

Part 2c: Profile the Code

Suppose you decide that the amount of CPU time consumed still is too large. You decide to investigate by doing a gprof analysis to determine which functions are consuming the most time...

Perform a gprof analysis of the code from Part 2b. Save the textual report in a file named gprofreport. Don't delete the file; as described later in this document, you must submit that file.

Part 2d: Add `BigInt` Objects Using Assembly Language Code

Suppose, not surprisingly, your gprof analysis shows that most CPU time is spent executing the BigInt_add function. In an attempt to gain speed, you decide to code the BigInt_add function manually in assembly language...

Manually translate the C code in the bigintadd.c file into assembly language, thus creating the file bigintadd.s. Do not translate the code in other files into assembly language.

Your assembly language code must store all parameters and local variables defined in the BigInt_larger and BigInt_add functions in memory, on the stack.

Note that assert is a parameterized macro, not a function. (See Section 14.3 of the King book for a description of parameterized macros.) So assembly language code cannot call assert. When translating bigintadd.c to assembly language, simply pretend that the calls of assert are not in the C code.

Build a fib program consisting of the files fib.c, bigint.c, and bigintadd.s using the -D NDEBUG and -O options. Run the program to compute fib(x) for various values of x, and make sure it writes the same output to stdout as the program built from C code does. Finally, run the program to compute fib(500000). In your readme file note the amount of CPU time consumed.

The FC010 /u/cos217/Assignment4 directory contains a file named simple.c. That file defines a client of the BigInt module that is much simpler than the client defined by fib.c. If your bigintadd.s implementation is failing tests performed by the fib.c client, then you might find it helpful to debug using the simple.c client.

Part 2e: Add `BigInt` Objects Using Optimized Assembly Language Code

Suppose, to your horror, you discover that you have taken a step backward: the CPU time consumed by your assembly language code is approximately the same as that of the non-optimized compiler-generated code! So you decide to optimize your assembly language code...

Manually optimize your assembly language code in bigintadd.s, thus creating the file bigintaddopt.s. Specifically, perform these optimizations:

Store all parameters defined in the BigInt_larger and BigInt_add functions in caller-saved registers (where they are originally) instead of in memory. That is, use caller-saved registers throughout the definitions of the functions.
Store all local variables defined in the BigInt_larger and BigInt_add functions in callee-saved registers instead of in memory.

Build a fib program consisting of the files fib.c, bigint.c, and bigintaddopt.s using the -D NDEBUG and -O options. Run the program to compute fib(x) for various values of x, and make sure it writes the same output to stdout as the program built from C code does. Finally, run the program to compute fib(500000). In your readme file note the amount of CPU time consumed.

If your bigintaddopt.s implementation is failing tests performed by the fib.c client, then you might find it helpful to debug using the simple.c client.

Can you write assembly language code that is approximately as fast as the optimized code that the compiler generates? That is, can you approximately tie the compiler?

Part 2f: Add `BigInt` Objects Using Highly Optimized Assembly Language Code

Finally, suppose you decide to optimize your assembly language code even further, moving away from a statement-by-statement translation of C code into assembly language...

Further optimize your assembly language code in bigintaddopt.s, thus creating the file bigintaddoptopt.s. Specifically, perform these optimizations:

Use the assembly language loop patterns described in Section 3.6.7 of the Bryant & O'Hallaron book instead of the simpler but less efficient patterns described in precepts.
"Inline" the call of the BigInt_larger function. That is, eliminate the BigInt_larger function, placing its code within the BigInt_add function.
Use the adcq ("add with carry quad") instruction effectively. The adcq instruction computes the sum of its source operand, its destination operand, and the "carry" flag from the EFLAGS register, places the sum in the destination operand, and assigns 1 (or 0) to the "carry" flag if a carry occurred (or did not occur) during the addition. Effective use of the adcq instruction will use the "carry" flag in the EFLAGS register instead of a uiCarry variable to keep track of carries during addition.

Feel free to implement any additional optimizations you want. However, your BigInt_add function must be a general-purpose function for adding two given BigInt objects; the function cannot be specific to the task of adding two Fibonacci numbers to generate a third Fibonacci number. In other words, your function must work with the given fib.c client.

This part is difficult; we will not think unkindly of you if you decide not to do it. To do it properly you will need to learn about the adcq instruction, and about which instructions affect and do not affect the "carry" flag in the EFLAGS register.

Hint: When writing bigintaddoptopt.s, the problem is this: How can you preserve the value of the "carry" flag between executions of the adcq instruction? One solution is to save and then restore the value of the EFLAGS register. Another solution is to express the logic such that only instructions that do not affect the "carry" flag are executed between each execution of the adcq instruction.

Build a fib program consisting of the files fib.c, bigint.c, and bigintaddoptopt.s using the -D NDEBUG and -O options. Run the program to compute fib(x) for various values of x, and make sure it writes the same output to stdout as the program built from C code does. Finally, run the program to compute fib(500000). In your readme file note the amount of CPU time consumed.

If your bigintaddoptopt.s implementation is failing tests performed by the fib.c client, then you might find it helpful to debug using the simple.c client.

Can you beat the compiler?

Logistics

Develop on FC010. Use emacs to create source code. Use gdb to debug.

Do not use a C compiler to produce any of your assembly language code. Doing so would be considered academically dishonest. Instead produce your assembly language code manually.

We encourage you to develop "flattened" C code (as described in lectures and precepts) to bridge the gap between the given "normal" C code and your assembly language code. Using flattened C code as a bridge can eliminate logic errors from your assembly language code, leaving only the possibility of translation errors.

We also encourage you to use your flattened C code as comments in your assembly language code. Such comments can clarify your assembly language code substantially.

Create a readme file by copying the file /u/cos217/Assignment4/readme to your project directory, and editing the copy by replacing each area marked "<Complete this section.>" as appropriate.

One of the sections of the readme file requires you to list the authorized sources of information that you used to complete the assignment. Another section requires you to list the unauthorized sources of information that you used to complete the assignment. Your grader will not grade your submission unless you have completed those sections. To complete the "authorized sources" section of your readme file, copy the list of authorized sources given in the "Policies" web page to that section, and edit it as appropriate.

Submit your work electronically on FC010 using these commands:

submit 4 mywc.s
submit 4 mywc*.txt
submit 4 gprofreport
submit 4 bigintadd.s
submit 4 bigintaddopt.s
submit 4 bigintaddoptopt.s
submit 4 readme

Grading

Minimal requirement to receive credit for the "Translate to Assembly Language" (Part 1a) implementation:

The mywc.s implementation must build.

Minimal requirement to receive credit for the "Add BigInt Objects Using Assembly Language Code" (Part 2d) implementation:

The bigintadd.s implementation must build with fib.c and bigint.c using the -D NDEBUG and -O options.

Minimal requirement to receive credit for the "Add BigInt Objects Using Optimized Assembly Language Code" (Part 2e) implementation:

The bigintaddopt.s implementation must build with fib.c and bigint.c using the -D NDEBUG and -O options.

Minimal requirement to receive credit for the "Add BigInt Objects Using Highly Optimized Assembly Language Code" (Part 2f) implementation:

The bigintaddoptopt.s implementation must build with fib.c and bigint.c using the -D NDEBUG and -O options.

We will grade your code on quality from the user's and programmer's points of view. From the user's point of view, your code has quality if it behaves as it must. The correct behavior of your code is defined by the previous sections of this assignment specification.

From the programmer's point of view, your code has quality if it is well-styled and thereby easy to maintain. Comments in your assembly language code are especially important. Each assembly language function — especially the main function — must have a comment that describes what the function does. Local comments within your assembly language functions are equally important. Comments copied from corresponding flattened C code are particularly helpful.

Your assembly language code must use .equ directives to avoid "magic numbers." In particular, in bigintadd.s you must use .equ directives to give meaningful names to:

Enumerated constants, for example: .equ TRUE, 1.
Parameter stack offsets, for example: .equ OADDEND1, 48.
Local variable stack offsets, for example: .equ ULCARRY, 24.
Structure field offsets, for example: .equ LLENGTH, 0.

In bigintaddopt.s you must use .equ directives to give meaningful names to:

Enumerated constants, for example: .equ TRUE, 1.
Registers that store parameters, for example: .equ OADDEND1, %rdi.
Registers that store local variables, for example: .equ ULCARRY, %r15.
Structure field offsets, for example: .equ LLENGTH, 0.

To encourage good coding practices, we will deduct points if gcc217 generates warning messages.

Testing is a substantial aspect of the assignment. Approximately 10% of the grade will be based upon your mywc test plan as described in your readme file and as implemented by your data files.

Your grade for Part 2f will be based upon:

Raw performance (5 percent). That is, how quickly does your code compute fib(500000)? We can't award any raw performance points unless your code passes all tests in the fib.c client.
Style (6 percent). That is, does your code contain optimizations that logically should improve performance?

This assignment was written by Robert M. Dondero, Jr.,
based in part upon suggestions from Andrew Appel.

Princeton University COS 217: Introduction to Programming Systems