INDEX
Explanations
contractions where 'am' or 'I'm' is used
expressions of personal opinion or self-referential statements
New Auto-Interp
Negative Logits
glers
-0.74
externalToEVAOnly
-0.68
eness
-0.68
Dynamics
-0.65
Gap
-0.64
士
-0.63
aneers
-0.63
Prev
-0.62
illance
-0.62
aceutical
-0.61
POSITIVE LOGITS
glad
1.32
guessing
1.30
sure
1.24
amazed
1.21
sorry
1.20
afraid
1.14
thankful
1.11
tempted
1.11
ashamed
1.09
grateful
1.08
Activations Density 0.134%