INDEX
Negative Logits
oundary
-0.14
æ³³
-0.14
Ìģ
-0.14
ToObject
-0.14
utenberg
-0.13
OOSE
-0.13
573
-0.13
,},↵
-0.13
uya
-0.13
æŃ
-0.13
POSITIVE LOGITS
fuck
0.17
Fuck
0.17
fuck
0.17
Fucked
0.16
itou
0.15
shit
0.14
hell
0.14
fucking
0.14
,
0.14
fucked
0.14
Activations Density 0.000%