R Statistical Analysis - bio-informatics

R Statistical Analysis - bio-informatics

Closed - This job posting has been filled.

Job Description

I am looking for a statistician and/or programmer in R who preferably has a background in bio-informatics or has used it for genetic research to give me written directions on how to use specific programs in R (either HAPMIX or StepPCO)

I am doing person genetic research. As a non-statiscian I am able to use a program based on ADMIXTURE (calculates percentage of ancestry, e.g, 23% Indian, 50% Irish, 27% German).

But here is the job: I also want to use either HAPMIX or StepPCO to calculate -when- the changes in ancestry occured. I cannot understand how to use them from the README file alone. I need someone who can give me detailed instructions and answer questions.

General:

Give me precise written directions on how to use either HAPMIX or StepPCO to calculate approximately how many generations ago admixture events occured to use a person's DNA information (about 1 million SNPs, represented by numbers).

Specifically:

a) show me how to look at a specific time when the DNA changed (i.e., indicating a marriage to a different ethnic group)

b) generate a list of 100 generations (about 3000 years back) with all the corresponding numbers

Extra - this is for those who have a bio-informatics background.

c) show me how to interpret the numbers - I will pay more for someone who can explain the significance of the numbers for ancestry purposes.

d) Find a way to use ADMIXTURE and HAPMIX/StepPCO together so I can interpret the data as percentages of ancestry instead of just DNA. For example, how to determine specifically that an ancestor married (for example) a German between (for example) 22-26 generations ago--not simply that the DNA changed.