INDEX
Explanations
names of people prominent in various contexts
New Auto-Interp
Negative Logits
himself
-0.17
Woo
-0.15
uss
-0.14
Erf
-0.14
873
-0.14
son
-0.14
Lev
-0.14
Baxter
-0.14
ÑĥÑģ
-0.14
Levin
-0.14
POSITIVE LOGITS
herself
0.20
ová
0.16
]=="
0.15
jeme
0.15
lesbian
0.15
ovna
0.15
Lesbian
0.15
fone
0.15
ÙĬدة
0.15
},'
0.15
Activations Density 0.080%