INDEX
Explanations
words related to permission or authorization
the phrase "wouldn't" and its variations
New Auto-Interp
Negative Logits
ocal
-0.68
ARM
-0.67
æ³
-0.65
core
-0.65
story
-0.63
Case
-0.63
Dialog
-0.63
Gong
-0.61
agency
-0.61
dress
-0.60
POSITIVE LOGITS
't
1.19
never
0.84
proble
0.84
surely
0.82
terness
0.77
ÃĥÃĤ
0.76
hardly
0.76
tremend
0.76
adjourn
0.75
itiveness
0.75
Activations Density 0.011%