INDEX
Explanations
instances of character names and their corresponding actions or emotional states
New Auto-Interp
Negative Logits
lesen
-0.17
fort
-0.15
žil
-0.15
طاÙĦ
-0.14
lexport
-0.14
avigate
-0.14
aÅŁ
-0.14
loor
-0.14
šti
-0.14
Äı
-0.14
POSITIVE LOGITS
stå
0.16
seins
0.15
holm
0.14
naï
0.14
233
0.14
IGH
0.13
ricks
0.13
absence
0.13
arry
0.13
rick
0.13
Activations Density 0.004%