INDEX
Explanations
a mix of monetary references, dates, and common prefixes/suffixes in words.
French words
New Auto-Interp
Negative Logits
#![
-0.37
pym
-0.36
lwz
-0.35
Marius
-0.35
+:+
-0.35
mips
-0.35
USART
-0.35
nero
-0.34
gql
-0.34
communis
-0.33
POSITIVE LOGITS
hâte
0.84
démission
0.84
dépens
0.83
dégâts
0.79
rêves
0.78
écl
0.78
égard
0.77
deuil
0.74
autorité
0.74
larmes
0.73
Activations Density 19.347%