INDEX
Explanations
mentions of the French language and related cultural references
New Auto-Interp
Negative Logits
extAlignment
-0.93
Hauptartikel
-0.91
Mij
-0.86
BBM
-0.84
wiada
-0.79
зыва
-0.79
awtextra
-0.79
Shetterly
-0.79
лерея
-0.78
RectangleBorder
-0.77
POSITIVE LOGITS
French
1.18
France
1.16
francesa
1.06
française
1.04
França
1.04
France
1.03
francés
1.02
FRANCE
0.99
français
0.97
French
0.96
Activations Density 0.050%