INDEX
Explanations
names of people, particularly those involved in journalism or reporting
New Auto-Interp
Negative Logits
ynos
-0.17
oints
-0.15
stad
-0.14
มà¸Ļà¸ķร
-0.14
Uploaded
-0.14
aju
-0.14
ÏĥÏĦε
-0.14
indo
-0.14
akers
-0.14
irim
-0.13
POSITIVE LOGITS
æĺ¯
0.19
bio
0.19
adalah
0.18
is
0.18
æĺ¯
0.18
æĺ¯ä¸Ģ
0.17
bio
0.16
.bio
0.16
lives
0.16
BIO
0.15
Activations Density 0.032%