INDEX
Explanations
phrases related to legal and political engagements
New Auto-Interp
Negative Logits
etry
-0.75
bey
-0.66
Origin
-0.66
opers
-0.66
ological
-0.63
Printed
-0.61
issue
-0.60
pler
-0.59
patient
-0.58
pled
-0.58
POSITIVE LOGITS
ments
0.90
TAIN
0.87
engaged
0.85
lished
0.84
ienced
0.82
engagement
0.78
aeper
0.73
reement
0.73
EMENT
0.72
encer
0.72
Activations Density 10.332%