INDEX
Explanations
mentions of places or geographic locations
phrases related to public figures and their actions or consequences
New Auto-Interp
Negative Logits
]."
-1.10
?).
-1.06
.).
-1.05
)."
-1.02
!).
-1.00
}.
-0.96
).
-0.95
)).
-0.95
%).
-0.90
).[
-0.88
POSITIVE LOGITS
ecause
0.49
Churchill
0.48
Cardiff
0.47
':
0.47
Albion
0.46
resa
0.45
ousy
0.44
Gamble
0.44
Proposition
0.44
owered
0.44
Activations Density 4.037%