INDEX
Explanations
strong emotional expressions or judgments in narratives
New Auto-Interp
Negative Logits
atrix
-0.16
dispatch
-0.14
ange
-0.14
.ndim
-0.14
eral
-0.13
ilda
-0.13
ions
-0.13
loss
-0.13
gem
-0.13
stars
-0.13
POSITIVE LOGITS
this
0.16
setLayout
0.16
Panel
0.15
deo
0.15
weis
0.15
opers
0.14
PANEL
0.14
iese
0.14
Panels
0.14
orgia
0.13
Activations Density 0.007%