INDEX
Explanations
discussions surrounding political decisions and changes in government policy
New Auto-Interp
Negative Logits
Ø®ÛĮ
-0.14
illard
-0.14
çŃĴ
-0.14
رÙĪØ´
-0.13
unwrap
-0.13
urer
-0.13
ersistent
-0.13
ekler
-0.13
idunt
-0.13
bilt
-0.13
POSITIVE LOGITS
backtrack
0.38
reversed
0.34
retract
0.32
reversal
0.32
cave
0.32
reversing
0.31
relent
0.30
reconsider
0.30
capit
0.30
reverse
0.30
Activations Density 0.171%