INDEX
Explanations
references to the name "Katy."
references to a specific individual named Katy
New Auto-Interp
Negative Logits
exile
-0.83
moss
-0.77
Rhino
-0.77
cius
-0.76
guards
-0.75
Rot
-0.73
guard
-0.70
ROM
-0.70
olin
-0.69
otine
-0.69
POSITIVE LOGITS
Katy
3.78
Katie
2.31
Kat
2.16
kat
1.47
Katrina
1.25
Katz
1.21
Rih
1.16
Kara
1.16
KP
1.16
Beyon
1.12
Activations Density 0.033%