INDEX
Explanations
responses in first-person narrative style
negations and expressions of limitations or inability
New Auto-Interp
Negative Logits
CLR
-0.73
Rosenstein
-0.66
Lavrov
-0.63
Hof
-0.61
Aerospace
-0.60
aic
-0.60
Highlands
-0.60
idates
-0.59
ATF
-0.59
Lange
-0.58
POSITIVE LOGITS
myself
0.90
ngth
0.69
praying
0.67
liking
0.67
personally
0.66
reckoning
0.66
venge
0.65
oan
0.64
writing
0.64
regret
0.62
Activations Density 0.855%