INDEX
Explanations
text that mention translations or translated content
references to translations of texts or media
New Auto-Interp
Negative Logits
ndra
-1.01
achine
-0.76
Peb
-0.75
pload
-0.73
Gamble
-0.72
reditary
-0.72
resy
-0.70
dinand
-0.70
\/\/
-0.70
pton
-0.70
POSITIVE LOGITS
translation
1.03
translations
1.02
translator
0.94
transl
0.92
translated
0.91
translation
0.89
into
0.86
subtitles
0.85
translates
0.83
language
0.81
Activations Density 0.039%