forked from DReichLab/AdmixTools
-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.QpWave
96 lines (70 loc) · 3.57 KB
/
README.QpWave
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
DOCUMENTATION OF QpWavs and qpAdm:
See also .examples/pdoc.pdf
qpWave:
qpWave requires that the input data is available in a Recilab format such as EIGENSTRAT.
To convert to the appropriate format, one can use CONVERTF. See
README.CONVERTF for documentation of programs for converting file formats.
Executable and source code:
------------------------------------------------------------------------------
For information about installing the program, see README.ADMIXTOOLS.
After installing the programs, the executable for qpWave should be located in the bin directory.
To run qpWave, type the following on a linux machine.
$DIR/bin/qpWave -p parfile >logfile
$DIR: Path to the bin directory.
logfile: Name of the logfile. The logfile contains the output of the run.
parfile: Name of parameter file
qpWave gives evidence of the number of admixture flows between the left and right populations,
and should usually be run as a precursor to qpAdm (see below).
DESCRIPTION OF EACH PARAMETER in parfile:
genotypename: input genotype file (in eigenstrat or packedancestrymap r format)
snpname: input snp file (in eigenstrat format)
indivname: input indiv file (in eigenstrat format)
popleft: left population list (1 per line)
popright: right population list (1 per line)
details: YES
allsnps: YES
## default NO
When set to YES, the maximum number of snps available for each f4 statistic are used. By default,
only the intersect of snps across all Left and Right populations are used.
chrom: Only use snps in the specified chromosome.
DESCRIPTION OF OUTPUT FILE:
The program will write all the output to stdout. The output file prints the parfile entered by the user,
number of snps and individuals, jackknife block size, number of blocks for jackknife and the results.
See example shown in-
examples/qpWave.log
qpAdm:
To run qpAdm, type the following on a linux machine.
$DIR/bin/qpAdm -p parfile >logfile
$DIR: Path to the bin directory.
logfile: Name of the logfile. The logfile contains the output of the run.
parfile: Name of parameter file
qpAdm takes a parameter file in exactly the same format as qpWave.
The first population of (popleft) is the target population and the
main purpose of the program is to provide admixture weights, writing the
target as a linear combination of the other populations of popleft.
details: YES/NO
is a new option (not available in qpWave)
.../examples/qpAdm.log has annotations ### manually added to explain output
CAVEATS
1) It is important to realize that the answers are invalid if there has
been post admixture gene-flow between left and right populations.
2) Weights may be negative. If significantly so, the admixture model is
probably wrong.
3) We recommend keeping popright small. If large the covariance matrix
of f4-statististics is likely to be poorly estimated. I then don't trust
the computed p-values although the admixture weights seem to usually be
reasonable.
4) *** NEW ***
The new version is more robust in boundary cases where
the mixing coefficients are not well determined. In
well-conditioned cases the difference from the old code should
be very small. There's a magic constant (diagplus) that gets added
along the diagonal of various matrices. To get the old answers with the new
program code:
diagplus: 0
*** new feature (for both qpAdm and qpWave)***
Input can now be an fstats file -- see README.qpfstats
For qpAdm new parameter numboot: see README.qpfstats
Nick Patterson
------------------------------------------------------------------------------