INDEX
Explanations
phrases related to policy and social issues
references to economic and social issues
New Auto-Interp
Negative Logits
interstitial
-0.87
çͰ
-0.86
''.
-0.83
Legend
-0.74
ĸļ
-0.72
]."
-0.70
yssey
-0.69
SourceFile
-0.68
unfocusedRange
-0.66
Adult
-0.66
POSITIVE LOGITS
democratically
0.97
coerc
0.94
harms
0.92
democracies
0.86
unpopular
0.83
impover
0.81
evils
0.79
oppress
0.78
inconven
0.78
trillions
0.77
Activations Density 1.295%