R Statistical Analysis - bio-informatics
Closed - This job posting has been filled.
I am looking for a statistician and/or programmer in R who preferably has a background in bio-informatics or has used it for genetic research to give me written directions on how to use specific programs in R (either HAPMIX or StepPCO)
I am doing person genetic research. As a non-statiscian I am able to use a program based on ADMIXTURE (calculates percentage of ancestry, e.g, 23% Indian, 50% Irish, 27% German).
But here is the job: I also want to use either HAPMIX or StepPCO to calculate -when- the changes in ancestry occured. I cannot understand how to use them from the README file alone. I need someone who can give me detailed instructions and answer questions.
Give me precise written directions on how to use either HAPMIX or StepPCO to calculate approximately how many generations ago admixture events occured to use a person's DNA information (about 1 million SNPs, represented by numbers).
a) show me how to look at a specific time when the DNA changed (i.e., indicating a marriage to a different ethnic group)
b) generate a list of 100 generations (about 3000 years back) with all the corresponding numbers
Extra - this is for those who have a bio-informatics background.
c) show me how to interpret the numbers - I will pay more for someone who can explain the significance of the numbers for ancestry purposes.
d) Find a way to use ADMIXTURE and HAPMIX/StepPCO together so I can interpret the data as percentages of ancestry instead of just DNA. For example, how to determine specifically that an ancestor married (for example) a German between (for example) 22-26 generations ago--not simply that the DNA changed.