INDEX
Explanations
descriptions of vulnerability and emotional states
New Auto-Interp
Negative Logits
Monfieur
-0.93
himſelf
-0.92
myſelf
-0.91
chofe
-0.89
pexpr
-0.87
Houſe
-0.86
Eſ
-0.84
ſtand
-0.84
ſever
-0.84
Efq
-0.83
POSITIVE LOGITS
رشف
0.46
ra
0.43
summers
0.40
im
0.40
rad
0.40
tenberg
0.40
بگو
0.40
Im
0.39
Perhaps
0.38
x
0.37
Activations Density 0.313%