INDEX
    Explanations

    expressions of skepticism and challenges in decision-making contexts

    New Auto-Interp
    Negative Logits
    dera
    -0.16
    ittel
    -0.16
    orz
    -0.15
     Second
    -0.15
    ilan
    -0.15
     Recently
    -0.14
    imat
    -0.14
    edl
    -0.13
     recent
    -0.13
     either
    -0.13
    POSITIVE LOGITS
     initially
    1.19
     initial
    1.03
    initial
    1.00
     Initially
    0.97
    Initially
    0.91
     Initial
    0.85
     inicial
    0.84
    Initial
    0.82
    æľĢåĪĿ
    0.79
    _initial
    0.75
    Act Density 0.459%

    No Known Activations