INDEX
Explanations
contractions of the verb 'are'
New Auto-Interp
Negative Logits
ESE
-0.77
Reduce
-0.72
DS
-0.67
mater
-0.67
TAIN
-0.67
andise
-0.65
membr
-0.64
ren
-0.60
iates
-0.60
iox
-0.59
POSITIVE LOGITS
gonna
1.50
gotta
1.10
going
1.06
hoping
1.04
supposed
1.00
afraid
0.97
glad
0.96
guessing
0.96
not
0.94
sorry
0.93
Activations Density 0.064%