INDEX
Explanations
occurrences of the phrase "for you" and its variations
New Auto-Interp
Negative Logits
simp
-0.17
ilar
-0.16
еÑĢп
-0.15
atters
-0.15
bles
-0.14
exe
-0.14
æ®
-0.14
ibly
-0.14
ihil
-0.14
оÑģÑĤан
-0.14
POSITIVE LOGITS
nl
0.16
agate
0.15
orang
0.15
zdy
0.15
iendo
0.15
gone
0.14
Bail
0.14
ÑĤик
0.14
Paladin
0.14
ARED
0.14
Activations Density 0.033%