INDEX
Explanations
pronouns indicating personal perspective and involvement
New Auto-Interp
Negative Logits
'},
-0.86
"){
-0.81
=")
-0.81
SequentialGroup
-0.79
]<<
-0.79
'){
-0.76
%")
-0.75
*/;
-0.75
AddTagHelper
-0.75
blest
-0.75
POSITIVE LOGITS
,
0.70
us
0.66
.
0.66
moi
0.63
sendiri
0.62
him
0.61
me
0.61
nobis
0.58
us
0.56
myself
0.55
Activations Density 0.118%