INDEX
Explanations
passive voice constructions
expressions of disbelief or negation
New Auto-Interp
Negative Logits
ĪĴ
-0.75
Revenge
-0.67
Cance
-0.67
stre
-0.63
Showdown
-0.62
Cancel
-0.61
Reloaded
-0.61
Reborn
-0.60
Fury
-0.60
Wrong
-0.60
POSITIVE LOGITS
bother
1.36
realize
1.33
realise
1.26
bothered
1.10
mention
1.03
acknowledge
1.02
admit
1.01
noticed
1.00
notice
0.99
comprehend
0.98
Activations Density 0.290%