INDEX
Explanations
references to France and its cultural or historical context
New Auto-Interp
Negative Logits
Hauptartikel
-0.84
Shetterly
-0.83
curio
-0.77
öhn
-0.77
poffe
-0.75
Kuli
-0.75
Mij
-0.73
expandindo
-0.73
KTP
-0.73
CURI
-0.71
POSITIVE LOGITS
y
0.91
française
0.85
French
0.84
francesa
0.84
France
0.83
Pogba
0.80
francés
0.78
französischen
0.77
France
0.76
tır
0.75
Activations Density 0.011%