INDEX
Explanations
statements or references to being or existing in a particular state or condition
New Auto-Interp
Negative Logits
Hacker
-0.60
MAS
-0.60
Shall
-0.60
Sakuya
-0.59
BT
-0.59
ANN
-0.58
Pai
-0.57
Freem
-0.56
429
-0.56
outlaw
-0.56
POSITIVE LOGITS
weakest
0.79
inki
0.75
nown
0.73
ezvous
0.73
safest
0.73
nearest
0.72
jured
0.71
zel
0.70
lins
0.70
essel
0.69
Activations Density 0.054%