INDEX
Explanations
phrases related to legal and political issues
New Auto-Interp
Negative Logits
sburg
-0.67
fair
-0.67
sov
-0.66
NN
-0.66
sat
-0.64
BF
-0.63
BP
-0.63
hops
-0.61
blast
-0.60
pad
-0.60
POSITIVE LOGITS
regards
1.65
stood
1.64
regard
1.62
draw
1.54
drawn
1.44
impunity
1.40
respect
1.30
standing
1.25
holding
1.20
utmost
0.94
Activations Density 1.392%