INDEX
Explanations
references to the name "Katie" or similar variations
New Auto-Interp
Negative Logits
iro
-0.17
aws
-0.17
ts
-0.15
rett
-0.15
rams
-0.15
ç·Ĵ
-0.15
unu
-0.15
labs
-0.14
haven
-0.14
Popular
-0.14
POSITIVE LOGITS
Perry
0.16
ungan
0.16
did
0.15
егоднÑı
0.15
meni
0.14
COUR
0.14
ISTIC
0.14
Cour
0.14
McCabe
0.14
ToUpper
0.14
Activations Density 0.009%