INDEX
Explanations
out of hand or out of control
New Auto-Interp
Negative Logits
igua
-0.80
операций
-0.77
カム
-0.73
shuffling
-0.73
ímos
-0.72
キック
-0.72
ös
-0.71
icherheit
-0.71
ikos
-0.71
thinkers
-0.70
POSITIVE LOGITS
out
2.48
control
1.73
outta
1.67
Control
1.55
control
1.50
uncontrollable
1.48
CONTROL
1.45
Out
1.45
Control
1.41
spir
1.38
Activations Density 0.014%