INDEX
Explanations
the presence of punctuation, specifically periods, indicating the end of sentences
New Auto-Interp
Negative Logits
vaux
-0.68
zha
-0.68
InputModule
-0.64
babwe
-0.60
InputDecoration
-0.60
GenerationType
-0.59
IRECT
-0.58
稲田
-0.57
cheidet
-0.57
metheus
-0.56
POSITIVE LOGITS
.)
2.00
.)
1.54
,)
1.54
.)}
1.52
.]
1.49
。)
1.49
.))
1.48
.”)
1.47
].)
1.42
.")
1.32
Activations Density 0.297%