INDEX
Explanations
phrases indicating movement or placement into a space
New Auto-Interp
Negative Logits
onte
-0.17
hed
-0.16
дол
-0.15
ç©´
-0.15
ữ
-0.14
[$_
-0.14
ìĬĪ
-0.14
.Åŀ
-0.14
梯
-0.13
ìĿ
-0.13
POSITIVE LOGITS
677
0.17
Fab
0.15
iyan
0.15
Ung
0.15
fab
0.15
Nga
0.15
abbo
0.15
aman
0.15
670
0.14
whose
0.14
Activations Density 0.071%