INDEX
Explanations
verbs or phrases indicating causation or resulting effects
phrases that express causation or effects
New Auto-Interp
Negative Logits
thia
-0.82
ban
-0.66
rina
-0.63
CLOSE
-0.60
pac
-0.59
phrine
-0.59
scrimmage
-0.58
champ
-0.58
presided
-0.58
withd
-0.58
POSITIVE LOGITS
hift
1.14
sense
1.00
sure
0.99
landfall
0.84
Sense
0.81
headlines
0.81
matters
0.78
AMERICA
0.71
arnaev
0.70
URE
0.67
Activations Density 0.115%