INDEX
Explanations
organizations or groups associated with specific causes or ideologies, particularly those focused on politics or advocacy
references to organizations or groups advocating for specific political or social causes
New Auto-Interp
Negative Logits
veins
-0.76
churn
-0.72
circles
-0.72
mson
-0.71
glances
-0.69
aggrav
-0.67
rumours
-0.66
gorge
-0.66
quarters
-0.65
accent
-0.64
POSITIVE LOGITS
Respons
1.14
Equality
1.10
Responsibility
1.09
Better
1.04
Change
0.96
Choice
0.94
Eth
0.92
Values
0.88
Reprodu
0.88
Peace
0.87
Activations Density 0.051%