INDEX
Explanations
mentions of various names, particularly "Katie" at varying levels of activation strength
mentions of specific individuals, particularly those named Katie and Julie
New Auto-Interp
Negative Logits
interstitial
-0.95
reed
-0.81
dfx
-0.81
lict
-0.76
enegger
-0.76
committee
-0.75
asks
-0.74
minist
-0.74
haps
-0.73
ribute
-0.72
POSITIVE LOGITS
Katie
1.00
Nolan
0.95
Kat
0.93
Upton
0.92
Cour
0.90
Sue
0.84
Burton
0.84
Fallon
0.84
Holmes
0.83
Anne
0.83
Activations Density 0.006%