INDEX
Explanations
individuals associated with specific roles or accomplishments in their fields
New Auto-Interp
Negative Logits
oman
-0.16
ADO
-0.14
zd
-0.14
re
-0.13
ilage
-0.13
каб
-0.13
hood
-0.13
Bea
-0.13
activ
-0.13
avÄĽ
-0.13
POSITIVE LOGITS
Weinstein
0.16
à¥Įद
0.16
umu
0.15
avl
0.15
ojÃŃ
0.15
ãĥ³ãĥij
0.15
iyel
0.15
meiden
0.14
oklyn
0.14
иÑĤи
0.14
Activations Density 0.532%