INDEX
Explanations
phrases indicating statements or opinions
frequent use of personal pronouns and subjective expressions
New Auto-Interp
Negative Logits
.</
-0.71
ãĢĤ
-0.67
.}
-0.66
.(
-0.61
�
-0.60
////
-0.60
ãĢij
-0.60
.<
-0.58
âĢ¢âĢ¢
-0.58
-->
-0.58
POSITIVE LOGITS
zbollah
1.01
odore
0.90
ffield
0.89
xiety
0.87
vana
0.85
certainly
0.85
usterity
0.84
definitely
0.82
%"
0.82
pherd
0.82
Activations Density 0.254%