INDEX
Explanations
phrases indicating consent or agreement related to plans or actions
New Auto-Interp
Negative Logits
ضاء
-0.16
urry
-0.15
iff
-0.14
omor
-0.14
ressing
-0.14
Samar
-0.14
irst
-0.14
677
-0.13
cop
-0.13
lette
-0.13
POSITIVE LOGITS
ably
0.19
ä¿Ĺ
0.15
istrovstvÃŃ
0.15
athed
0.14
earing
0.14
baum
0.14
ILT
0.13
Kis
0.13
eyi
0.13
lá»Ŀi
0.13
Activations Density 0.021%