INDEX
Explanations
instances of the word "kiss" and its variations
New Auto-Interp
Negative Logits
lund
-0.15
632
-0.15
series
-0.15
orton
-0.14
atz
-0.14
llib
-0.14
ttp
-0.14
acock
-0.14
imal
-0.14
ãĥ¼ãĥ©
-0.14
POSITIVE LOGITS
gro
0.18
burgh
0.16
ulumi
0.16
baiser
0.15
anzi
0.15
ylland
0.15
seins
0.14
ÙĪØµ
0.14
bersome
0.14
_DECL
0.14
Activations Density 0.010%