INDEX
Explanations
phrases related to making decisions or sharing opinions
the phrase "I'll" to indicate future intentions or actions
New Auto-Interp
Negative Logits
Hebdo
-0.74
Tactics
-0.68
Stability
-0.60
eele
-0.60
endez
-0.58
Result
-0.58
populations
-0.56
-+-+
-0.55
AVG
-0.55
Scores
-0.54
POSITIVE LOGITS
myself
0.80
sorry
0.74
telling
0.72
ope
0.71
guessing
0.71
na
0.71
agree
0.70
glad
0.70
grim
0.69
ishi
0.69
Activations Density 0.085%