INDEX
Explanations
adverbs expressing certainty or confidence
expressions of certainty or emphasis
New Auto-Interp
Negative Logits
insula
-0.90
issy
-0.78
ocene
-0.77
AME
-0.76
orie
-0.75
anwhile
-0.69
uese
-0.69
ricks
-0.69
artment
-0.68
arro
-0.67
POSITIVE LOGITS
deserved
0.74
irritated
0.70
satisfied
0.67
surely
0.67
tempted
0.67
footed
0.67
"$:/
0.67
è¦
0.66
annoyed
0.66
rejo
0.65
Activations Density 0.010%