INDEX
Explanations
mentions of organizations or campaigns
mentions of organizations, notable individuals, and institutions related to social issues and legal contexts
New Auto-Interp
Negative Logits
frame
-0.90
fram
-0.84
rome
-0.84
station
-0.82
wagen
-0.80
stay
-0.80
frames
-0.79
hab
-0.79
uten
-0.78
train
-0.77
POSITIVE LOGITS
NAACP
1.00
GOODMAN
0.84
ONSORED
0.82
Cosponsors
0.78
glim
0.78
ICS
0.77
ENTS
0.76
Legal
0.74
ACP
0.73
resents
0.73
Activations Density 0.026%