INDEX
Explanations
phrases related to government accountability or weakness
New Auto-Interp
Negative Logits
ÙĴت
-0.15
porto
-0.15
ж
-0.15
inerary
-0.15
_blob
-0.15
istica
-0.15
ApplicationContext
-0.14
erate
-0.14
üz
-0.14
Westbrook
-0.14
POSITIVE LOGITS
odds
0.35
logger
0.25
Odds
0.25
witter
0.24
pains
0.23
bay
0.22
peace
0.22
liberty
0.22
fault
0.22
capacity
0.22
Activations Density 0.040%