INDEX
Explanations
phrases indicating possession or individuality
New Auto-Interp
Negative Logits
itself
-0.43
Itself
-0.40
horen
-0.39
itself
-0.39
เอง
-0.37
herself
-0.35
speelt
-0.34
spreken
-0.34
sám
-0.33
hésite
-0.33
POSITIVE LOGITS
initiative
0.56
kind
0.54
AsUp
0.52
pace
0.52
opinion
0.51
linkovi
0.51
&___
0.50
private
0.49
skin
0.48
special
0.48
Activations Density 0.016%