INDEX
Explanations
phrases related to emotional struggles and difficult situations
New Auto-Interp
Negative Logits
eqn
-0.66
ⓧ
-0.65
surla
-0.63
adicionais
-0.61
femininos
-0.61
préférence
-0.58
normais
-0.58
חיצוניים
-0.57
Baillargeon
-0.57
cref
-0.56
POSITIVE LOGITS
rut
0.84
bind
0.72
position
0.69
dold
0.66
limbo
0.65
rut
0.64
hole
0.61
predicament
0.60
dilemma
0.58
Bind
0.58
Activations Density 0.213%