INDEX
    Explanations

    expressions of decision-making and resolutions

    New Auto-Interp
    Negative Logits
    ipa
    -0.15
    ichel
    -0.14
    inati
    -0.14
    Äįer
    -0.14
    ван
    -0.14
    hoa
    -0.13
    aná
    -0.13
    iev
    -0.13
    larım
    -0.13
    ateg
    -0.13
    POSITIVE LOGITS
    against
    0.17
     instead
    0.16
     Against
    0.16
     лÑĥÑĩ
    0.16
    instead
    0.16
     against
    0.16
    rather
    0.16
    Against
    0.16
    Instead
    0.15
     skoro
    0.15
    Act Density 0.026%

    No Known Activations