INDEX
Explanations
references to historical figures, particularly Winston Churchill
mentions of historical or notable figures, particularly Winston Churchill
New Auto-Interp
Negative Logits
itate
-0.71
itated
-0.69
IRO
-0.67
alysis
-0.67
ities
-0.67
>>>>>>>>
-0.66
tarians
-0.66
nery
-0.66
uce
-0.65
cle
-0.64
POSITIVE LOGITS
Churchill
1.32
enburg
0.77
ufact
0.75
minster
0.74
Peters
0.73
ipeg
0.72
age
0.71
llor
0.68
fur
0.68
endor
0.67
Activations Density 0.119%