INDEX
Explanations
phrases or words that can be translated from one language to another
phrases related to translation or conversion of concepts and values
New Auto-Interp
Negative Logits
Peb
-0.93
awks
-0.83
drm
-0.82
odder
-0.81
oeuv
-0.78
cot
-0.76
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.75
eper
-0.74
idem
-0.70
dra
-0.69
POSITIVE LOGITS
translate
1.30
translated
1.24
translates
1.22
translating
1.21
translations
1.09
transl
1.02
translator
1.01
translation
0.98
translation
0.91
corrid
0.91
Activations Density 0.011%