INDEX
Negative Logits
i
0.83
b
0.71
at
0.70
h
0.68
g
0.64
f
0.60
3
0.60
d
0.56
a
0.55
}.
0.52
POSITIVE LOGITS
It
0.84
</h3>
0.71
л
0.71
</h2>
0.67
it
0.63
</b>
0.63
</i>
0.59
ъ
0.57
</h6>
0.55
ü
0.52
Activations Density 0.000%
i
b
at
h
g
f
3
d
a
}.
It
</h3>
л
</h2>
it
</b>
</i>
ъ
</h6>
ü