INDEX
Explanations
content related to government, opinions, and policy
phrases and terms related to governmental accountability and performance
New Auto-Interp
Negative Logits
Cub
-0.57
sexes
-0.56
visualization
-0.55
Brit
-0.55
classmate
-0.55
teammate
-0.55
cubes
-0.55
occurrence
-0.54
Ov
-0.54
Semin
-0.54
POSITIVE LOGITS
appoint
0.86
erity
0.81
legisl
0.78
appointing
0.76
veto
0.75
enact
0.74
arrog
0.73
prag
0.72
inaction
0.71
wisely
0.70
Activations Density 1.168%