INDEX
Explanations
descriptions related to political figures or campaigns
sentiments and concepts related to love and unity
New Auto-Interp
Negative Logits
âĢ
-0.76
ÂŃ
-0.72
̶
-0.69
âĢ
-0.64
—
-0.63
ðŁ
-0.62
âĢķ
-0.62
ãĥ
-0.62
Ö¼
-0.61
ÂŃ
-0.61
POSITIVE LOGITS
Others
0.77
others
0.71
other
0.71
Others
0.68
OTHER
0.66
pts
0.65
Other
0.60
oppos
0.60
conflicting
0.58
Same
0.56
Activations Density 1.364%