INDEX
Explanations
negative or critical sentiments
negations and expressions of inability or disappointment
New Auto-Interp
Negative Logits
Allies
-0.65
reinstated
-0.64
assador
-0.64
applic
-0.60
liber
-0.59
significance
-0.59
embodiments
-0.57
pot
-0.56
ensible
-0.55
effectiveness
-0.55
POSITIVE LOGITS
disappoint
1.25
hesitate
1.13
doubt
1.04
forget
0.95
forgetting
0.93
shy
0.93
envy
0.89
miss
0.85
fail
0.84
disappointed
0.84
Activations Density 0.276%