INDEX
Explanations
mentions of languages
mentions of languages
New Auto-Interp
Negative Logits
xus
-0.92
lier
-0.91
igham
-0.87
urion
-0.87
vre
-0.85
rons
-0.80
ldon
-0.79
llan
-0.78
rences
-0.78
apego
-0.77
POSITIVE LOGITS
translation
1.06
pronunciation
1.03
diction
0.97
language
0.97
translations
0.90
languages
0.90
transl
0.89
Nadu
0.88
accents
0.87
transcription
0.87
Activations Density 0.085%