INDEX
Explanations
mentions of government entities
references to government entities and their actions
New Auto-Interp
Negative Logits
ranging
-0.79
ãĥ¼ãĥĨ
-0.67
actionDate
-0.67
Angle
-0.65
Frequency
-0.64
colo
-0.64
elia
-0.63
sbm
-0.62
Chart
-0.62
Dialog
-0.62
POSITIVE LOGITS
intervened
0.88
Accountability
0.85
tasked
0.83
opted
0.82
sanctioned
0.81
recognized
0.80
slapped
0.75
bureaucracy
0.73
reacted
0.72
instituted
0.71
Activations Density 0.097%