INDEX
Explanations
legal and political terms, including dates and locations
legal and political terms related to decisions and actions
New Auto-Interp
Negative Logits
!)
-0.70
)|
-0.69
!),
-0.66
â̦)
-0.66
...)
-0.64
)"
-0.64
)--
-0.64
-)
-0.60
)'
-0.59
?)
-0.59
POSITIVE LOGITS
etheless
1.08
respectively
0.97
nonetheless
0.66
markedly
0.63
strikingly
0.62
nevertheless
0.61
remarkably
0.61
ogether
0.60
systematically
0.59
umbered
0.58
Activations Density 1.068%