INDEX
Explanations
phrases indicating speech or communication
New Auto-Interp
Negative Logits
dub
-0.14
ALAR
-0.13
çevir
-0.13
isdigit
-0.12
ugi
-0.12
translator
-0.12
edl
-0.12
.VisualBasic
-0.12
zeigen
-0.12
035
-0.12
POSITIVE LOGITS
word
1.09
words
1.09
words
0.91
word
0.91
WORD
0.89
-word
0.88
Words
0.85
Word
0.82
Word
0.80
_word
0.78
Activations Density 0.317%