INDEX
Explanations
mentions of sides or perspectives, often related to politics or social issues
negative political affiliations or sentiments
New Auto-Interp
Negative Logits
ĸļ
-0.85
layers
-0.74
partName
-0.72
Wonderland
-0.71
ILCS
-0.70
SHARES
-0.70
..........
-0.69
Nig
-0.69
Ago
-0.68
beetles
-0.68
POSITIVE LOGITS
establishment
1.22
social
1.21
government
1.15
development
1.04
choice
0.99
capitalist
0.99
cycl
0.97
competitive
0.97
claimed
0.97
interest
0.96
Activations Density 0.023%