INDEX
Explanations
phrases related to actions done in the name of a certain cause or authority
New Auto-Interp
Negative Logits
Cosponsors
-0.80
igue
-0.79
oor
-0.77
iaries
-0.75
ards
-0.73
iste
-0.70
ask
-0.69
jad
-0.68
aves
-0.67
read
-0.67
POSITIVE LOGITS
protecting
0.91
preserving
0.90
facilitating
0.87
combating
0.85
promoting
0.84
advancing
0.84
enhancing
0.83
aiding
0.81
improving
0.81
sheer
0.81
Activations Density 0.156%