INDEX
Explanations
emotions and physical sensations
references to emotional experiences and quantitative observations
New Auto-Interp
Negative Logits
colonial
-0.48
veto
-0.48
Tax
-0.47
lymph
-0.47
Penal
-0.47
天
-0.46
mone
-0.46
cartoon
-0.46
act
-0.45
Refugee
-0.45
POSITIVE LOGITS
nonetheless
0.85
anyway
0.74
awaru
0.74
anyways
0.72
moreover
0.63
altogether
0.62
encompasses
0.61
proport
0.60
EEE
0.58
overwhel
0.58
Activations Density 1.186%