INDEX
Explanations
first-person and third-person pronouns indicating personal experiences and perspectives
New Auto-Interp
Negative Logits
loff
-0.15
.RightToLeft
-0.15
akan
-0.14
erken
-0.14
ÏĢιÏĥ
-0.14
tement
-0.14
engo
-0.14
ltk
-0.14
ÑĢой
-0.14
YNAM
-0.13
POSITIVE LOGITS
alic
0.15
enter
0.15
877
0.15
allel
0.14
ABCDEFG
0.14
posix
0.14
@$_
0.14
ilton
0.13
ilde
0.13
Imported
0.13
Activations Density 0.107%