INDEX
Explanations
creative expressions and the processes behind artistic works
New Auto-Interp
Negative Logits
Tau
-0.15
wl
-0.14
ewood
-0.14
onor
-0.14
bots
-0.14
Barnett
-0.13
treff
-0.13
Bowling
-0.13
pert
-0.13
aeda
-0.13
POSITIVE LOGITS
behind
0.94
Behind
0.76
beh
0.73
Behind
0.68
_beh
0.60
beh
0.54
.beh
0.50
underlying
0.48
achter
0.47
-be
0.44
Activations Density 0.190%