INDEX
Explanations
written conversations with a colon to indicate a speaker
punctuation marks or colons at the end of sentences or phrases
New Auto-Interp
Negative Logits
behavi
-0.80
avorite
-0.72
rule
-0.72
caster
-0.69
depreciation
-0.68
poons
-0.68
respective
-0.66
reckoning
-0.66
overl
-0.64
inement
-0.64
POSITIVE LOGITS
Yeah
1.00
Bye
0.88
Who
0.82
Yes
0.81
Oh
0.81
TBD
0.80
Impossible
0.77
Huh
0.75
Okay
0.75
Yep
0.74
Activations Density 0.086%