INDEX
Explanations
dates and titles
elements related to trivia or informational content
New Auto-Interp
Negative Logits
merce
-0.83
amacare
-0.79
hello
-0.74
bang
-0.73
answer
-0.72
brainer
-0.72
olding
-0.71
thouse
-0.69
EMA
-0.69
arching
-0.68
POSITIVE LOGITS
Edit
1.03
Trivia
1.01
Contents
0.93
Appearances
0.93
".[
0.92
flashback
0.91
canon
0.88
Gallery
0.86
,[
0.84
edit
0.83
Activations Density 0.735%