INDEX
Explanations
direct speech marked with quotations
dialogue punctuation, particularly the use of quotation marks
New Auto-Interp
Negative Logits
avorite
-0.89
irtual
-0.71
¥ŀ
-0.69
vide
-0.68
Lauder
-0.67
deterrent
-0.65
satell
-0.65
phthal
-0.64
triv
-0.64
subsid
-0.62
POSITIVE LOGITS
Oh
1.01
Hey
0.99
hey
0.94
Sir
0.88
Yo
0.85
I
0.84
cause
0.83
Everybody
0.83
Wait
0.81
Let
0.80
Activations Density 0.087%