INDEX
Explanations
phrases related to effort and responsibility in completing tasks
New Auto-Interp
Negative Logits
achi
-0.20
asher
-0.16
ACHI
-0.15
packageName
-0.15
Rao
-0.15
hiro
-0.15
ubbo
-0.14
алÑĸв
-0.14
olf
-0.14
.isSuccessful
-0.14
POSITIVE LOGITS
heavy
0.36
heavy
0.34
Heavy
0.31
dirty
0.30
dirty
0.30
leg
0.30
work
0.28
grunt
0.28
Heavy
0.27
grunt
0.27
Activations Density 0.075%