INDEX
Explanations
instances of the word "talking" and its variations, indicating a focus on dialogue or discussions
New Auto-Interp
Negative Logits
emale
-0.76
iverpool
-0.75
uilt
-0.73
feeding
-0.70
boa
-0.70
peria
-0.68
metic
-0.67
eele
-0.65
cffff
-0.64
proc
-0.63
POSITIVE LOGITS
Heads
0.87
louder
0.86
about
0.82
aloud
0.81
Points
0.78
loudly
0.78
heads
0.74
voices
0.72
Points
0.72
filibuster
0.71
Activations Density 0.020%