INDEX
Explanations
phrases indicating advice or recommendations for actions
Follows "should," "Please", or "to"
instructions or advice to follow
New Auto-Interp
Negative Logits
OGND
-0.52
Diretto
-0.49
PeEnEo
-0.49
yym
-0.48
jsonify
-0.47
agramm
-0.46
]++;
-0.46
遽
-0.46
Vidite
-0.46
Hj
-0.45
POSITIVE LOGITS
remember
1.10
consider
1.04
beware
0.90
remember
0.86
Consider
0.86
Remember
0.85
familiarize
0.82
check
0.81
Remember
0.81
avoid
0.80
Activations Density 0.282%