INDEX
Explanations
political keywords related to specific individuals or groups
references to political figures and entities associated with ideological divides
New Auto-Interp
Negative Logits
oscope
-0.75
Adren
-0.71
phies
-0.71
pak
-0.70
redits
-0.66
aps
-0.65
effective
-0.65
osc
-0.62
pipe
-0.62
aft
-0.61
POSITIVE LOGITS
istani
0.81
anmar
0.77
hybrids
0.72
unity
0.72
heid
0.71
sentiment
0.71
relations
0.70
iosity
0.70
sentiments
0.70
hierarch
0.69
Activations Density 0.102%