BIOL4585

Part 1: revisiting coding homework, variables and loops.

See the files codeV1.txt, codeV2.txt, and codeV3.txt. As a group, write down:

For homework, you will learn about running scripts. As a teaser, you can run codeV1 and codeV2 as scripts right now by entering one of the following commands:

The word ‘bash’ tells the computer to execute the file as a script, executing it with the bash language.

We will discuss as a class prior to moving on to part 2.

Part 2: Superhero Genome-Wide Association Study (GWAS) activity

This activity will be conducted in class–wait for further instruction.

Part 3: Superhero GWAS with data

Genetic data has been collected for 1000 individuals, for four different snps. Investigate the file with the name superpowerGWAS.txt. We will use this data to determine if all, none, or some subset of four genotyped alleles have an association with the superpowered phenotype.

We’ll start looking only at the first snp. In order to actually calculate a statistic for genetic association, we need to find four separate counts:

Each member of your table should individually collect one of the above counts. The file is too large and unsorted to actually count by eye, so you’ll need to use commands to count the number of individuals that fit each of the criteria.

Next, we want to find the allele counts for the four groups at the three remaining snps. That sounds like a lot of work. I can sympathize that writing out four separate commands just for one SNP can be tedious. Let’s be lazy programmers. To make things easier, we’ll write a loop with variables to do calculate all of the counts. As a group,

Homework