Assignment Goals
Solve a fundamental problem in computational biology.
Learn about strings in C.
Learn about the analysis of algorithms.
Checking Your Work and Hints
Various test protein and genetic input files are located at
/u/cs126/files/sequence/
.
You may use our reference solution
sequence126
to test your solution.
Here's a link to some
hints
if you need a place to get started.
Submission and readme
Submit the following files:
readme.txt sequence.c
The
readme.txt
file should contain the following information. Here is a
template readme file
.
Name, precept number, high level description of code, any problems encountered, and whatever help (if any) your received.
A table of running times (in seconds) for the input files listed.
An estimate of the running time and memory consumption as a function of N, where N is the number of characters in the input file.
Your submitted program should handle strings of length up to 10,000. It's fine to declare a global 10,001 x 10,001 array and use only the part of it that you need. Caution: if you are running on a Windows system and get an error message like
The system cannot execute the specified program.
then use a smaller value, say 9001 x 9001, when testing, but your submitted code should still use 10,001 x 10,001.
Enrichment Links
The genetic data are taken from the
National Center For Biotechnology Information
. This site also contains many examples of such database and alignment software.
Kevin Wayne