INDEX
Explanations
phrases expressing conflict or challenges
New Auto-Interp
Negative Logits
rosso
-0.18
rellas
-0.16
esub
-0.15
etas
-0.15
Otherwise
-0.14
andler
-0.14
uvo
-0.14
ager
-0.14
rosse
-0.13
immel
-0.13
POSITIVE LOGITS
depending
1.18
depending
1.04
Depending
0.74
depends
0.73
Depending
0.69
depend
0.67
depends
0.65
Depends
0.64
depended
0.62
dependent
0.58
Activations Density 0.298%