INDEX
Explanations
action-oriented verbs and phrases indicating completion or achievement
New Auto-Interp
Negative Logits
��
-0.84
istani
-0.73
ガ
-0.72
�
-0.72
obe
-0.70
agger
-0.69
kamp
-0.68
orem
-0.68
likely
-0.67
stru
-0.67
POSITIVE LOGITS
oneself
0.81
countless
0.80
their
0.80
exhaustive
0.76
numerous
0.76
themselves
0.75
rave
0.73
endless
0.72
utmost
0.72
extensive
0.71
Activations Density 1.118%