INDEX
Explanations
phrases related to political discourse and leadership challenges
New Auto-Interp
Negative Logits
еÑģÑĤи
-0.08
alis
-0.07
aru
-0.06
incare
-0.06
emale
-0.06
vid
-0.06
onta
-0.06
enou
-0.06
ehir
-0.06
beit
-0.06
POSITIVE LOGITS
whose
0.09
who
0.08
whose
0.08
someone
0.07
who
0.07
.Dispatcher
0.07
LLL
0.07
ibo
0.06
guy
0.06
ponsible
0.06
Activations Density 0.027%