INDEX
Negative Logits
terce
-0.08
cx
-0.08
Tl
-0.08
isasi
-0.07
Td
-0.07
stories
-0.07
honey
-0.07
bu
-0.07
barang
-0.07
hver
-0.07
POSITIVE LOGITS
entitled
0.11
titled
0.10
/report
0.09
-worthy
0.09
contender
0.08
intitul
0.08
件
0.08
pendiente
0.08
/book
0.08
menée
0.08
Activations Density 0.011%