INDEX
Explanations
the occurrences of the name "Claire" and its variations
New Auto-Interp
Negative Logits
seins
-0.15
ê·¹
-0.15
fitte
-0.15
.populate
-0.15
ư
-0.14
roker
-0.14
anh
-0.14
дап
-0.14
nowhere
-0.14
spir
-0.13
POSITIVE LOGITS
mont
0.28
ty
0.23
voy
0.22
ment
0.20
Voy
0.19
Boo
0.18
ece
0.18
sville
0.17
issent
0.17
nces
0.16
Activations Density 0.005%