INDEX
Explanations
French words and names
occurrences of the character "é" and its variants
New Auto-Interp
Negative Logits
utterstock
-0.78
etsk
-0.77
pread
-0.77
ageddon
-0.75
INESS
-0.70
enged
-0.70
Cobra
-0.67
redistributed
-0.66
berra
-0.64
multipl
-0.64
POSITIVE LOGITS
lé
1.18
Dame
0.89
é
0.88
vez
0.87
ré
0.86
rez
0.83
ité
0.82
ration
0.82
mie
0.80
cé
0.79
Activations Density 0.022%