INDEX
Explanations
phrases related to feelings of powerlessness and emotional struggles
New Auto-Interp
Negative Logits
pedia
-0.15
obsc
-0.14
rips
-0.14
urar
-0.14
xed
-0.14
óg
-0.14
alls
-0.14
ÑĢÑĥн
-0.14
fq
-0.14
strup
-0.14
POSITIVE LOGITS
aim
0.32
direction
0.27
hope
0.25
clue
0.25
hope
0.24
fee
0.23
direction
0.23
help
0.23
aim
0.23
fruit
0.22
Activations Density 0.398%