INDEX
Explanations
the name "Karen" and its variations in different contexts
Karen and Kare variants
New Auto-Interp
Negative Logits
Fü
-0.34
zhi
-0.33
tp
-0.33
zio
-0.32
submit
-0.31
PRIMARY
-0.31
オ
-0.31
Muhamma
-0.31
扁
-0.30
пра
-0.30
POSITIVE LOGITS
Karen
2.22
Karen
2.09
karen
1.62
karen
1.59
Kare
1.13
Kare
1.05
kare
0.92
kare
0.90
Karin
0.86
Karin
0.81
Activations Density 0.003%