INDEX
    Explanations

    sentences involving confirmations, assertions, and notable statements made by individuals

    New Auto-Interp
    Negative Logits
    ezier
    -0.17
    arget
    -0.16
    aspers
    -0.15
    аков
    -0.15
    erp
    -0.15
    à¥įतव
    -0.15
    lique
    -0.14
    elle
    -0.14
     Maul
    -0.14
    еÑĢк
    -0.14
    POSITIVE LOGITS
     also
    0.17
    Also
    0.16
     Also
    0.15
     Hind
    0.15
     juga
    0.15
     ALSO
    0.14
     hope
    0.14
     pyl
    0.14
    esini
    0.14
    EEK
    0.13
    Act Density 0.057%

    No Known Activations