INDEX
Explanations
words related to shame or shameful actions
New Auto-Interp
Negative Logits
Ajax
-0.85
Tone
-0.73
Leth
-0.67
OLOGY
-0.66
Reach
-0.66
Luther
-0.64
unfocusedRange
-0.63
anwhile
-0.62
ground
-0.61
ãĥ¼ãĥĨãĤ£
-0.60
POSITIVE LOGITS
apesh
1.29
atters
1.29
rapnel
1.24
apeshifter
1.22
oddy
1.20
aded
1.19
aders
1.17
attering
1.15
ackle
1.13
ippers
1.13
Activations Density 0.018%