INDEX
Explanations
pharmaceutical amphetamines
New Auto-Interp
Negative Logits
W
0.65
B
0.62
J
0.62
T
0.60
R
0.60
Erfol
0.58
L
0.57
K
0.57
nennen
0.57
annoncé
0.56
POSITIVE LOGITS
a
0.73
্ড
0.70
the
0.65
1
0.64
5
0.63
enie
0.61
ences
0.60
2
0.59
that
0.59
6
0.59
Activations Density 0.000%