INDEX
Explanations
occurrences of the French pronouns and their variations
New Auto-Interp
Negative Logits
trash
-0.16
488
-0.15
738
-0.15
teness
-0.15
еÑĦ
-0.14
erts
-0.14
одо
-0.14
asser
-0.14
pus
-0.13
onso
-0.13
POSITIVE LOGITS
ATAL
0.15
Ä¢
0.15
íķĢ
0.15
wen
0.14
Ø´ÙĬ
0.14
мом
0.14
ENTS
0.14
PROCUREMENT
0.14
elden
0.14
ắng
0.13
Activations Density 0.009%