INDEX
Explanations
terms related to political positions or roles, specifically "minister."
references to government officials and their titles
New Auto-Interp
Negative Logits
esville
-0.74
Braves
-0.67
Crew
-0.66
oven
-0.64
user
-0.63
Crow
-0.63
Crew
-0.63
Model
-0.63
Century
-0.62
irc
-0.61
POSITIVE LOGITS
ariat
0.75
Lavrov
0.75
envoy
0.75
spokesman
0.75
Marino
0.73
chair
0.73
adviser
0.72
gestures
0.72
ruary
0.72
advisor
0.72
Activations Density 0.077%