INDEX
Explanations
phrases expressing commitment or ongoing involvement
New Auto-Interp
Negative Logits
itself
-0.23
æĪijçļĦ
-0.21
meiner
-0.17
saya
-0.17
ç»ĻæĪij
-0.17
minha
-0.16
meu
-0.16
meine
-0.15
my
-0.15
mijn
-0.15
POSITIVE LOGITS
ourselves
0.36
talking
0.26
fortunate
0.22
lucky
0.22
told
0.21
mere
0.20
hearing
0.20
seeing
0.20
living
0.19
fortunate
0.19
Activations Density 0.104%