INDEX
Explanations
imperative phrases emphasizing the importance of taking action or making decisions
New Auto-Interp
Negative Logits
anas
-0.18
illa
-0.17
ILLA
-0.16
hl
-0.15
kh
-0.14
zar
-0.14
alles
-0.14
idd
-0.13
addtogroup
-0.13
laÄį
-0.13
POSITIVE LOGITS
iren
0.15
à¸ŀà¸Ń
0.15
inator
0.14
Horny
0.14
andel
0.14
note
0.14
ÙĮ
0.14
erdale
0.14
ëŁ½
0.14
anden
0.13
Activations Density 0.028%