INDEX
Explanations
phrases and statements indicating stress and chaotic situations
New Auto-Interp
Negative Logits
monds
-0.14
acus
-0.14
uder
-0.14
ndo
-0.13
ój
-0.13
entric
-0.13
eyin
-0.13
æ¿
-0.12
bic
-0.12
obic
-0.12
POSITIVE LOGITS
add
0.69
Add
0.64
add
0.57
Add
0.54
added
0.53
-add
0.52
adds
0.51
Added
0.51
.add
0.50
_add
0.49
Activations Density 0.312%