INDEX
Negative Logits
negoci
0.44
garbage
0.44
quies
0.44
chargés
0.44
satisf
0.43
lifetimes
0.43
extremes
0.43
score
0.43
binaries
0.42
⊔
0.42
POSITIVE LOGITS
img
0.55
iframe
0.50
align
0.49
allery
0.48
italics
0.47
ah
0.47
italic
0.47
oljš
0.46
ally
0.46
'><
0.46
Activations Density 0.013%