INDEX
Explanations
mentions of nobility or noble characteristics
New Auto-Interp
Negative Logits
itte
-0.18
lage
-0.16
ASE
-0.15
ìĪł
-0.14
tele
-0.14
forder
-0.14
İT
-0.14
Tele
-0.14
ENER
-0.13
Penalty
-0.13
POSITIVE LOGITS
rett
0.18
getti
0.16
lett
0.15
âħ
0.14
olini
0.14
.documentation
0.14
575
0.14
nob
0.14
marshal
0.14
áci
0.14
Activations Density 0.015%