INDEX
Explanations
statements of existence or presence in a text
New Auto-Interp
Negative Logits
ibold
-0.18
ilon
-0.16
stayed
-0.15
.async
-0.15
ÑĢап
-0.15
lld
-0.14
ogg
-0.14
alon
-0.14
staying
-0.14
ijkl
-0.14
POSITIVE LOGITS
Progress
0.27
progress
0.26
Progress
0.23
_progress
0.21
progress
0.20
progressing
0.19
-progress
0.19
.Progress
0.18
progressed
0.18
progression
0.18
Activations Density 0.007%