INDEX
Explanations
expressions of reassurance and comfort
New Auto-Interp
Negative Logits
maji
-0.50
########.
-0.50
knap
-0.49
رشف
-0.49
sportback
-0.49
correctes
-0.48
Leider
-0.48
tph
-0.47
niyang
-0.47
zin
-0.47
POSITIVE LOGITS
worry
1.68
Worry
1.49
worries
1.44
fear
1.38
worry
1.32
fret
1.24
Fear
1.22
Fear
1.20
tenang
1.19
fear
1.18
Activations Density 0.137%