INDEX
Explanations
expressions related to actions and decisions
New Auto-Interp
Negative Logits
oneself
-0.15
yourselves
-0.15
andes
-0.14
даÑĤ
-0.14
InThe
-0.14
PLL
-0.13
onth
-0.13
eil
-0.13
367
-0.13
(the
-0.13
POSITIVE LOGITS
his
0.44
seu
0.40
sua
0.40
her
0.38
seus
0.38
suas
0.35
their
0.35
seine
0.33
your
0.32
suo
0.32
Activations Density 1.000%