INDEX
Explanations
references to personal possessions and relationships
New Auto-Interp
Negative Logits
sumer
-0.17
rema
-0.16
mentioned
-0.15
nist
-0.15
omik
-0.15
Mention
-0.14
unhappy
-0.14
ÅĻed
-0.14
omi
-0.14
elsius
-0.14
POSITIVE LOGITS
eger
0.16
ÑĭÑĪ
0.15
athi
0.15
assi
0.15
erner
0.14
太éĥİ
0.14
yp
0.14
ahlen
0.14
DF
0.14
azine
0.13
Activations Density 0.270%