INDEX
Explanations
references to French culture or items associated with France
New Auto-Interp
Negative Logits
emean
-0.17
atrice
-0.15
lift
-0.15
aidu
-0.15
obs
-0.14
ìĭľìĺ¤
-0.14
ennes
-0.14
eut
-0.14
topics
-0.14
è³
-0.14
POSITIVE LOGITS
æĭ
0.15
migrationBuilder
0.15
spe
0.14
Ư
0.14
iores
0.14
åĭĴ
0.14
anity
0.13
IRO
0.13
filer
0.13
оÑģÑĤав
0.13
Activations Density 0.008%