INDEX
Explanations
text indicating instructions or suggestions
New Auto-Interp
Negative Logits
ELD
-0.76
Cre
-0.58
MpServer
-0.58
Fra
-0.57
Fundamental
-0.57
Pos
-0.53
wheelchair
-0.52
hurts
-0.52
harmed
-0.51
Geh
-0.51
POSITIVE LOGITS
consider
0.81
beware
0.80
consult
0.79
subscribe
0.76
avoid
0.76
caution
0.75
heed
0.75
hesitate
0.75
check
0.74
omit
0.72
Activations Density 15.248%