INDEX
Explanations
inclusive and informal addresses to a group of people
New Auto-Interp
Negative Logits
itſelf
-0.98
mergeFrom
-0.92
ſelf
-0.91
Anſ
-0.91
himſelf
-0.91
DockStyle
-0.90
uſe
-0.88
PMailer
-0.87
myſelf
-0.87
ftagPool
-0.86
POSITIVE LOGITS
0.58
!
0.55
here
0.51
my
0.49
,
0.49
:
0.46
.
0.46
here
0.45
my
0.45
amigos
0.44
Activations Density 0.020%