INDEX
Explanations
the pronoun "you" in the context of exploring perspectives or relationships
New Auto-Interp
Negative Logits
dorf
-0.16
ec
-0.15
second
-0.15
еÑĢв
-0.14
immers
-0.14
ibs
-0.14
ä
-0.14
eyse
-0.14
Ģìŀ¥
-0.14
igmoid
-0.13
POSITIVE LOGITS
astle
0.15
nef
0.15
anz
0.15
tridge
0.14
Rubin
0.14
ATK
0.13
è´µ
0.13
éĶĻ
0.13
iffies
0.13
-src
0.13
Activations Density 0.000%