INDEX
Explanations
phrases related to music and entertainment
topics related to media and entertainment, particularly music and film
New Auto-Interp
Negative Logits
)",
-0.95
]);
-0.85
]),
-0.85
));
-0.83
)]
-0.78
?",
-0.78
"),
-0.78
'),
-0.78
])
-0.78
),
-0.77
POSITIVE LOGITS
.
1.02
.?
0.84
.#
0.76
._
0.71
_.
0.70
.>>
0.69
*.
0.66
./
0.65
/.
0.64
shit
0.63
Activations Density 0.704%