INDEX
Explanations
references to dialogue or communication involving questions and answers
New Auto-Interp
Negative Logits
RTEE
-0.78
transQ
-0.71
IntoConstraints
-0.63
CURIAM
-0.60
ſind
-0.60
RTCF
-0.59
+#+#
-0.58
Biôgrafia
-0.57
kasarigan
-0.56
myſelf
-0.55
POSITIVE LOGITS
hear
0.32
hears
0.31
attention
0.31
pār
0.31
vacacionales
0.30
ัญ
0.29
forças
0.29
viņ
0.29
ações
0.28
Présentation
0.28
Activations Density 0.481%