INDEX
Explanations
references to brevity and clarity in communication
New Auto-Interp
Negative Logits
rg
-0.15
Ãło
-0.15
575
-0.15
kia
-0.15
pip
-0.15
jak
-0.15
fer
-0.14
Fer
-0.14
pip
-0.14
rio
-0.14
POSITIVE LOGITS
WORDS
0.18
à¥įथन
0.16
words
0.16
ControlEvents
0.16
.glide
0.15
VOICE
0.15
voice
0.15
words
0.15
à¹Ĩ
0.14
Це
0.14
Activations Density 0.085%