INDEX
    Explanations

    references to dialogue or communication involving questions and answers

    New Auto-Interp
    Negative Logits
    RTEE
    -0.78
    transQ
    -0.71
    IntoConstraints
    -0.63
     CURIAM
    -0.60
     ſind
    -0.60
    RTCF
    -0.59
    +#+#
    -0.58
     Biôgrafia
    -0.57
     kasarigan
    -0.56
     myſelf
    -0.55
    POSITIVE LOGITS
     hear
    0.32
     hears
    0.31
     attention
    0.31
     pār
    0.31
     vacacionales
    0.30
    ัญ
    0.29
     forças
    0.29
     viņ
    0.29
     ações
    0.28
    Présentation
    0.28
    Act Density 0.481%

    No Known Activations