INDEX
Explanations
personal negative emotions and uncertainty
New Auto-Interp
Negative Logits
您的
1.72
вашего
1.67
вашей
1.58
你的
1.52
ваша
1.51
вашем
1.50
ваш
1.50
Ihrem
1.48
ваших
1.46
your
1.42
POSITIVE LOGITS
anxiety
1.45
sometimes
1.40
usually
1.32
myself
1.25
Usually
1.21
stupid
1.20
kinda
1.18
depressing
1.14
mostly
1.14
Sometimes
1.13
Activations Density 0.110%