INDEX
Explanations
proper nouns related to politics and government officials
prominent political figures and institutions
New Auto-Interp
Negative Logits
Translation
-0.64
Els
-0.60
Pengu
-0.57
ãĥ¼ãĥĨ
-0.57
âĶģ
-0.55
Redd
-0.55
bearing
-0.54
ãĥ¬
-0.53
farious
-0.53
âĸ¬âĸ¬
-0.51
POSITIVE LOGITS
has
1.19
insists
1.14
wants
1.12
undertook
1.11
intends
1.09
refuses
1.09
believes
1.08
denies
1.07
reacted
1.07
expects
1.07
Activations Density 0.763%