INDEX
Explanations
verbs related to actions or decisions that can have an impact on others
actions related to interaction and emotional impact
New Auto-Interp
Negative Logits
)]
-0.61
vg
-0.61
CVE
-0.60
hawks
-0.58
union
-0.58
ilts
-0.58
utenberg
-0.57
Above
-0.56
eded
-0.56
foundland
-0.55
POSITIVE LOGITS
yourself
2.12
yourselves
1.93
your
1.51
Yourself
1.41
YOUR
1.26
your
1.10
Your
1.06
Your
1.06
oneself
0.95
yours
0.94
Activations Density 1.132%