INDEX
Explanations
proper nouns associated with individuals and organizations
New Auto-Interp
Negative Logits
ynos
-0.16
:\"
-0.16
ÑĤоÑīо
-0.15
.spatial
-0.15
ANJI
-0.14
itten
-0.13
ryn
-0.13
ighton
-0.13
ï¼Į以åıĬ
-0.13
اÙĪØ±
-0.13
POSITIVE LOGITS
vice
0.18
executive
0.17
vice
0.16
chief
0.16
director
0.15
,the
0.15
former
0.15
president
0.15
Executive
0.15
Executive
0.15
Activations Density 0.079%