INDEX
Explanations
terms related to emotional awareness and expression
New Auto-Interp
Negative Logits
innen
-0.17
asant
-0.16
ض
-0.16
inition
-0.15
Cumhuriyeti
-0.15
ourg
-0.15
werk
-0.14
æŀ¶
-0.14
estyle
-0.14
otas
-0.14
POSITIVE LOGITS
/em
0.20
ÑĨионалÑĮ
0.18
ãģ¾ãģ¾
0.18
charged
0.18
nel
0.16
emotional
0.16
ized
0.16
eel
0.16
roller
0.16
attachment
0.16
Activations Density 0.024%