INDEX
Explanations
phrases indicating audience engagement or involvement
New Auto-Interp
Negative Logits
ikal
-0.07
egl
-0.07
filer
-0.07
kes
-0.06
tb
-0.06
_nf
-0.06
fal
-0.06
yms
-0.06
pell
-0.06
aç
-0.06
POSITIVE LOGITS
eland
0.07
irth
0.06
ople
0.06
bakan
0.06
lamaz
0.06
**)&
0.06
ONUS
0.06
اراÙĨ
0.06
linger
0.06
pton
0.06
Activations Density 0.002%