Does someone knows how to get the next followed structure in Pandas?
I have a Pandas data frame that looks like this
ID VAR1 VAR2 VAR3 VAR4 VAR5
1 TT TG .01 .09 .90
2 CC TC .20 .34 .56
3 GG AG .10 .20 .70
4 TC TC .17 .48 .35
5 AG AG .12 .46 .32
6 GG TG .15 .20 .65
7 TT TC .06 .24 .70
8 TT CT .11 .21 .68
9 GT GT .05 .10 .85
- If the first 'letter' in [VAR2] (e.g T) it is doubled in [VAR1] (e.g. TT) -> take value from VAR3 (.01).
- If the combination of the letters in [VAR2] (e.g. TC) it is the same as in [VAR1] (e.g. TC) -> take value from VAR4 (.48).
- If the second 'letter' in [VAR2] (e.g G) it is doubled in [VAR1] (e.g GG) -> take value from VAR5 (.70).
Based on this information I would like my output data frame to look as follow:
ID VAR1 VAR2 VAR6
1 TT TG .01
2 CC TC .56
3 GG AG .70
4 TC TC .48
5 AG AG .46
6 GG TG .65
7 TT TC .06
8 TT CT .68
9 GT GT .10
Hopefully someone can give me a hint!
Thanks,
Ale.
Aucun commentaire:
Enregistrer un commentaire