INDEX
Explanations
references to the name "Kate" in the text
mentions of the name "Kate"
New Auto-Interp
Negative Logits
ATIONS
-0.89
atio
-0.87
indal
-0.76
ATIVE
-0.75
ations
-0.72
externalToEVAOnly
-0.70
iple
-0.70
ensical
-0.69
ENSE
-0.69
ATOR
-0.68
POSITIVE LOGITS
Upton
1.02
vich
0.93
Wins
0.89
lyn
0.87
patrick
0.82
Moss
0.82
Mara
0.79
hi
0.78
rina
0.78
vic
0.78
Activations Density 0.038%