INDEX
Explanations
references to actions and assertions made by individuals
New Auto-Interp
Negative Logits
оригіналу
-0.70
doesn
-0.57
Do
-0.56
dos
-0.56
don
-0.54
ıştı
-0.53
gráficos
-0.53
DON
-0.53
jspx
-0.52
DO
-0.50
POSITIVE LOGITS
did
2.64
did
1.92
Did
1.85
Did
1.84
DID
1.28
DID
1.09
didst
0.91
gjorde
0.86
done
0.85
hicieron
0.72
Activations Density 0.373%