INDEX
Explanations
instances of the word "well" used as an interjection or indicating agreement
the word "Well" with varying frequencies of activation
New Auto-Interp
Negative Logits
amas
-0.67
dash
-0.66
anca
-0.64
aca
-0.62
acters
-0.61
agne
-0.61
punishable
-0.59
identifying
-0.59
ibi
-0.58
replacing
-0.56
POSITIVE LOGITS
Well
3.38
Well
2.59
well
1.82
well
1.44
Hmm
1.35
Alright
1.35
Anyway
1.34
Yeah
1.33
Turns
1.23
Okay
1.22
Activations Density 0.012%