INDEX
Negative Logits
varandra
-0.89
enfans
-0.85
zelve
-0.83
itſelf
-0.80
avoient
-0.76
voisins
-0.69
themſelves
-0.69
pauvres
-0.68
auroit
-0.66
alluminio
-0.66
POSITIVE LOGITS
"}>
0.54
')['
0.54
});*/
0.53
})*/
0.52
?>">
0.51
]}>
0.51
=!
0.50
.');
0.50
stand
0.50
()}>
0.49
Activations Density 0.025%