INDEX
    Explanations

    phrases related to belief and conviction

    New Auto-Interp
    Negative Logits
    ui
    -0.17
    olla
    -0.14
    afort
    -0.14
    antee
    -0.14
    quam
    -0.14
     chez
    -0.14
    è·¡
    -0.13
    ines
    -0.13
    iaux
    -0.13
     Tato
    -0.13
    POSITIVE LOGITS
    ardu
    0.17
    ordion
    0.17
     jadx
    0.15
     addCriterion
    0.15
    bose
    0.14
    nda
    0.14
     Hlav
    0.14
    udeau
    0.14
    649
    0.14
     Král
    0.14
    Act Density 0.026%

    No Known Activations