INDEX
Explanations
elements related to creativity and storytelling
New Auto-Interp
Negative Logits
wl
-0.15
auen
-0.15
onor
-0.15
Tau
-0.14
unal
-0.14
bash
-0.14
illard
-0.14
Barnett
-0.14
aptors
-0.14
rado
-0.14
POSITIVE LOGITS
behind
1.06
Behind
0.86
beh
0.80
Behind
0.78
_beh
0.63
beh
0.55
achter
0.53
.beh
0.52
-be
0.50
underlying
0.50
Activations Density 0.215%