INDEX
Explanations
mentions of specific names or titles, particularly those related to the "Bun" and "Kun" categories
New Auto-Interp
Negative Logits
itſelf
-0.87
Astra
-0.84
OCCURRED
-0.84
bebasan
-0.83
Sapp
-0.82
Jefus
-0.81
➤
-0.80
corporativa
-0.79
myſelf
-0.79
houſe
-0.79
POSITIVE LOGITS
Bun
1.20
Bun
1.15
bun
1.09
bun
1.06
BUN
1.06
Cun
1.05
KUN
1.02
BUN
1.01
JUN
1.00
KUN
1.00
Activations Density 0.081%