INDEX
Explanations
phrases related to catastrophic events or tragedies
the presence of the term "Se" and its various repetitions, indicating a strong focus on specific data or metrics
New Auto-Interp
Negative Logits
backwards
-0.70
bonding
-0.70
oxy
-0.68
affer
-0.68
regor
-0.65
goof
-0.65
backward
-0.64
trust
-0.63
penalty
-0.62
disrespect
-0.61
POSITIVE LOGITS
Se
3.64
Se
1.92
se
1.67
SE
1.50
Seg
1.48
Seek
1.25
Cho
1.25
Sequ
1.21
Sel
1.19
Sc
1.18
Activations Density 0.014%