INDEX
Explanations
names of a specific individual, potentially related to news or events
the name "Myers" and related contexts about a specific individual or case
New Auto-Interp
Negative Logits
ives
-0.71
ī
-0.66
Canary
-0.65
illard
-0.65
Sax
-0.65
bral
-0.65
Chrom
-0.63
loads
-0.63
inates
-0.62
bow
-0.62
POSITIVE LOGITS
mberg
0.89
cffff
0.78
auga
0.74
hler
0.74
chell
0.73
borough
0.73
pring
0.72
Briggs
0.72
ister
0.70
chool
0.70
Activations Density 0.028%