INDEX
Explanations
phrases with a strong emotional or emphatic tone
pronouns and demonstrative words
New Auto-Interp
Negative Logits
lining
-0.70
Weston
-0.65
alties
-0.60
colle
-0.57
AFL
-0.56
Naples
-0.56
climates
-0.56
xp
-0.52
Lavrov
-0.52
stemming
-0.52
POSITIVE LOGITS
estine
0.83
%"
0.81
cellence
0.80
.""
0.77
`.
0.70
"—
0.70
'"
0.70
ACTED
0.69
]"
0.69
nomine
0.69
Activations Density 0.177%