INDEX
Explanations
references to drinking, particularly alcoholic beverages
New Auto-Interp
Negative Logits
sieur
-0.98
Lans
-0.96
olesale
-0.92
nourriture
-0.91
Ubi
-0.91
Réponses
-0.90
揄
-0.88
urethane
-0.84
propylene
-0.83
tershire
-0.83
POSITIVE LOGITS
rin
1.01
gin
0.98
tin
0.91
Perrin
0.91
in
0.90
IN
0.89
lin
0.89
Dina
0.88
Fin
0.86
gin
0.84
Activations Density 2.891%