INDEX
Explanations
phrases related to causes or reasons for certain events or conditions
phrases indicating causation
New Auto-Interp
Negative Logits
apest
-0.67
oos
-0.65
urai
-0.64
igmat
-0.61
appro
-0.57
Letter
-0.55
cott
-0.54
IFT
-0.53
Malt
-0.53
talk
-0.52
POSITIVE LOGITS
by
1.39
BY
1.08
by
1.07
By
0.93
principally
0.91
By
0.90
chiefly
0.88
partly
0.84
bys
0.81
solely
0.81
Activations Density 0.146%