INDEX
Explanations
phrases and expressions related to requests and asking for permission
New Auto-Interp
Negative Logits
oul
-0.15
Malk
-0.14
gro
-0.14
ÙĨÙģ
-0.14
lush
-0.14
esses
-0.14
zig
-0.14
溶
-0.14
yy
-0.14
kle
-0.14
POSITIVE LOGITS
illac
0.16
wer
0.15
اÙĦÙĤÙĬ
0.15
hausen
0.15
YM
0.15
ervo
0.15
Press
0.15
esar
0.14
GRAM
0.14
ì²
0.14
Activations Density 0.296%