INDEX
Explanations
mixed statements with opinions and observations
references to emotional or mental states of individuals
New Auto-Interp
Negative Logits
otine
-0.62
spokeswoman
-0.57
%%
-0.55
ackets
-0.53
sonian
-0.51
jurisd
-0.51
arest
-0.51
stadt
-0.50
-0.50
roups
-0.49
POSITIVE LOGITS
he
1.95
He
1.86
His
1.85
his
1.80
his
1.79
He
1.67
His
1.58
himself
1.46
he
1.42
HIS
1.30
Activations Density 0.909%