INDEX
Explanations
verbs indicating completion or finalization of an action
New Auto-Interp
Negative Logits
ãĥ³ãĤ¸
-0.64
millenn
-0.60
²¾
-0.56
contingency
-0.55
alog
-0.55
lear
-0.55
paralle
-0.55
ocaly
-0.54
ibur
-0.54
nic
-0.53
POSITIVE LOGITS
.[
0.99
!.
0.87
.","
0.86
.;
0.84
.
0.82
.''.
0.81
.'
0.81
.):
0.80
.]
0.78
.</
0.77
Activations Density 0.368%