INDEX
Explanations
phrases related to political figures or locations, particularly "Downing Street."
references to specific locations and political figures
New Auto-Interp
Negative Logits
ropri
-0.73
rieg
-0.69
afia
-0.68
Gutenberg
-0.67
netflix
-0.66
ilk
-0.65
ENDED
-0.65
olina
-0.64
eport
-0.63
asters
-0.63
POSITIVE LOGITS
Downing
0.96
nuts
0.95
nut
0.87
stall
0.83
vous
0.82
tree
0.81
hurst
0.81
stein
0.80
Britann
0.68
berg
0.67
Activations Density 0.026%