INDEX
Explanations
the repetition and emphasis of the word "so."
New Auto-Interp
Negative Logits
prd
-0.18
uisse
-0.16
õi
-0.16
kaar
-0.15
rott
-0.15
aint
-0.15
pr
-0.15
craft
-0.15
umm
-0.15
alace
-0.15
POSITIVE LOGITS
-called
0.26
onest
0.20
ìį¨
0.20
oner
0.19
far
0.19
aping
0.18
ars
0.18
oth
0.18
iled
0.16
ester
0.16
Activations Density 0.069%