From this video (21'15''), the speaker gives the following formulae in order to calculate the between and among populations genetic variance from the FST:
VAmongPop=2FSTVG
VWithinPop=(1−FST)VG
, where VG is the total genetic variance of this population if it was well mixing. VAmongPop is the variance among populations and VWithinPop is the variance within populations.
In this same video, the speaker defines FST as:
FST=Var(p)ˉp(1−ˉp)
, where p is a vector of frequencies of a given allele and ˉp and Var(p) are the mean and variance of this vector.
For example, consider a metapopulation made of 4 subpopulations. The allele frequencies in these 4 subpopulations are p=[0.2, 0.5, 0.8, 0.3]. ˉp is the mean of p (ˉp=0.45) and Var(p) is the variance of p (Var(p)=0.07 ).
Can you help me to make sense of the formulas for VAmongPop and VWithinPop?
Can you prove these formulas? I would expect that VAmongPop+VWithinPop=VG but it doesn't! Maybe the issue has to do with the fact that VWithinPop as defined above fit to haploid population and not diploid populations. Then, I would expect that for diploid populations VWithinPop becomes (1−2FST)VG. Is that correct? Just in order to make it more general, how do you extrapolate these definitions for a tetraploid population? Also maybe I misunderstand the meaning of VG. Is the variance within population the sum of the within populations variance or is it the average (or something else)? Thanks for your help!
No comments:
Post a Comment