INDEX
Explanations
expressions of refusal and determination in conversations
New Auto-Interp
Negative Logits
lag
-0.18
vÃŃce
-0.16
iso
-0.15
defgroup
-0.14
ico
-0.14
yn
-0.13
eless
-0.13
Tep
-0.13
Elevated
-0.13
Labels
-0.13
POSITIVE LOGITS
firm
0.25
fixed
0.22
fixed
0.20
åĽº
0.19
flex
0.19
Firm
0.19
immutable
0.18
flexibility
0.17
Flex
0.17
firm
0.17
Activations Density 0.244%