INDEX
Explanations
phrases related to political figures taking action or expressing plans
political intentions and actions related to lawmakers
New Auto-Interp
Negative Logits
datas
-0.53
catentry
-0.52
optimization
-0.52
decomp
-0.52
âĶĢâĶĢ
-0.52
Pwr
-0.50
beginner
-0.49
larvae
-0.48
sample
-0.46
phys
-0.46
POSITIVE LOGITS
Democrats
0.72
trump
0.71
clinton
0.67
Democrats
0.66
Putin
0.66
Republicans
0.66
conservatives
0.65
politic
0.64
Obama
0.64
partisan
0.64
Activations Density 5.610%