5.12 confatherduos.dat

confatherduos.dat: This file contains a header line (which is not read in by the program but is useful for reminding yourself of the column order), followed by a line of data for each of the n SNPs to be analysed (IN EXACTLY THE SAME ORDER as given in emimmarkers.dat).

The first number on each line is the numeric SNP ID (as given in emimmarkers.dat). This is followed by 7 cell counts corresponding to the number of fully genotyped control/father duos whose genotype combinations fall into the appropriate genotype categories. Note that by controls we mean individuals of unknown disease status, or (provided the disease is rare) individuals who are known to be disease-free. This must not include individuals who have already appeared in any other input files (i.e. all input data files must be independent).

For example, suppose that at the first SNP the genotype combinations of the father and child are

group father child count
1 22 22 0
2 22 12 1
3 12 22 0
4 12 12 1
5 12 11 1
6 11 12 2
7 11 11 6

Then the line in confatherduos.dat corresponding to this SNP would look like:

1      0 1 0 1 1 2 6

An example of confatherduos.dat for 8 SNPs, of which the first has counts as given above, and the other seven just happen to all have exactly the same (different set of) genotype counts (admittedly a contrived example is shown below:

snp cellcount 1-7
1      0 1 0 1 1 2 6
2      0 0 0 0 1 2 8
3      0 0 0 0 3 1 7
4      0 0 0 0 5 3 3
5      0 0 0 3 1 0 7
6      0 0 0 0 1 0 10
7      0 0 0 0 2 4 5
8      0 1 0 0 0 0 10