INDEX
Explanations
mentions of specific individuals in various contexts
mentions of specific individuals, particularly those related to the context of the narrative
New Auto-Interp
Negative Logits
Arabia
-0.75
ERAL
-0.73
cap
-0.71
gered
-0.71
AME
-0.71
ership
-0.71
odiac
-0.70
psc
-0.70
ebus
-0.69
ghai
-0.68
POSITIVE LOGITS
Browne
1.32
millenn
0.84
challeng
0.83
Byrne
0.78
livest
0.78
anas
0.76
Tiff
0.74
itton
0.71
elsius
0.71
squats
0.71
Activations Density 0.005%