INDEX
Negative Logits
AndEndTag
-0.82
IsContent
-0.80
ſeveral
-0.78
Jefus
-0.77
новниш
-0.77
Monfieur
-0.75
تفصیلات
-0.74
myſelf
-0.74
uſed
-0.74
uſe
-0.74
POSITIVE LOGITS
image
0.58
an
0.52
ph
0.52
images
0.49
app
0.49
像
0.49
mental
0.48
inverted
0.47
virtual
0.47
e
0.43
Activations Density 0.004%