INDEX
Explanations
specific mentions of the name "Kate" in the text
occurrences of the name "Kate."
New Auto-Interp
Negative Logits
ATIONS
-0.83
oresc
-0.81
ensed
-0.78
cffff
-0.78
ribution
-0.76
notation
-0.75
tremend
-0.75
plom
-0.72
orescence
-0.72
ensical
-0.71
POSITIVE LOGITS
Upton
1.10
Wins
0.94
McCann
0.94
Stein
0.87
Turner
0.85
Mara
0.84
Kate
0.84
rette
0.80
Tempest
0.80
Kane
0.80
Activations Density 0.007%