INDEX
Explanations
phrases related to political leadership and unity
New Auto-Interp
Negative Logits
udos
-0.48
earchers
-0.47
idav
-0.46
Variant
-0.45
urai
-0.45
bnb
-0.44
spokesman
-0.43
à¨
-0.43
aez
-0.43
ische
-0.43
POSITIVE LOGITS
$.
0.64
%.
0.64
}.
0.58
'.
0.56
)).
0.55
.�
0.53
]).
0.53
]."
0.51
.''.
0.51
".
0.51
Activations Density 8.823%