INDEX
Explanations
phrases related to the topic of death
New Auto-Interp
Negative Logits
"",
-0.75
".
-0.75
.";
-0.70
"/";
-0.69
'/';
-0.69
@}
-0.68
)»
-0.67
―――
-0.67
asimismo
-0.66
firent
-0.66
POSITIVE LOGITS
really
1.17
pretty
1.03
guys
1.00
REALLY
0.98
Really
0.91
really
0.90
okay
0.88
kinda
0.85
shit
0.85
haha
0.82
Activations Density 0.196%