INDEX
Explanations
mentions of political figures named "Muammar" with differing activations suggesting a focus on specific instances or elements related to the individual throughout the text
references to the name "Muammar."
New Auto-Interp
Negative Logits
代
-0.90
Peb
-0.71
OME
-0.69
EMS
-0.69
CAST
-0.67
Dispatch
-0.66
Dangerous
-0.65
peak
-0.65
pans
-0.64
arantine
-0.63
POSITIVE LOGITS
riages
1.07
quez
0.88
iously
0.87
ques
0.85
anth
0.81
zynski
0.79
iam
0.78
rying
0.76
quet
0.76
ital
0.75
Activations Density 0.009%