INDEX
Explanations
expressions of disappointment or dissatisfaction related to new developments or changes
New Auto-Interp
Negative Logits
رÛĮÙĩ
-0.16
bump
-0.16
spit
-0.15
hang
-0.15
øj
-0.15
quil
-0.15
strup
-0.14
veau
-0.14
.sy
-0.14
add
-0.14
POSITIVE LOGITS
aside
0.27
aside
0.18
Aside
0.18
uptools
0.18
=set
0.18
parameters
0.18
:set
0.17
embro
0.17
tle
0.17
sail
0.17
Activations Density 0.065%