INDEX
Explanations
reluctantly, against will, with anxiety
New Auto-Interp
Negative Logits
grav
-0.12
modest
-0.10
div
-0.10
timid
-0.10
Bren
-0.09
iddy
-0.09
concern
-0.09
enery
-0.08
rejected
-0.08
vaguely
-0.08
POSITIVE LOGITS
gr
0.38
reluct
0.28
reluctant
0.27
Rel
0.27
udging
0.23
reluctantly
0.22
reluctance
0.22
forced
0.20
forced
0.19
ä¸įå¾Ĺ
0.19
Activations Density 0.140%