INDEX
Negative Logits
feroit
-0.78
itſelf
-0.78
sauvages
-0.76
enfans
-0.75
antMatchers
-0.74
vœux
-0.74
houſe
-0.74
auroit
-0.74
raiſ
-0.73
étoit
-0.72
POSITIVE LOGITS
s
0.78
NE
0.56
ся
0.54
ento
0.54
F
0.53
na
0.53
site
0.53
ne
0.52
grand
0.52
</strong>
0.51
Activations Density 0.078%