INDEX
    Explanations

    phrases indicating initial reactions or responses to various events or actions

    New Auto-Interp
    Negative Logits
    gota
    -0.07
    celik
    -0.06
    emek
    -0.06
    cek
    -0.06
    campo
    -0.06
    ationship
    -0.06
    icari
    -0.06
    اسÙħ
    -0.06
    ftware
    -0.06
    terdam
    -0.06
    POSITIVE LOGITS
    aph
    0.07
    own
    0.07
    ->{'
    0.07
    mixed
    0.07
    bage
    0.07
    rement
    0.06
    ź
    0.06
    gre
    0.06
     vol
    0.06
    zen
    0.06
    Act Density 0.006%

    No Known Activations