INDEX
    Explanations

    terms related to qualifications and conditions for actions or events

    New Auto-Interp
    Negative Logits
     Con
    -0.28
    Con
    -0.17
    -Con
    -0.17
     conjug
    -0.16
    šit
    -0.15
     Thu
    -0.15
    chu
    -0.15
     Cone
    -0.15
    ufs
    -0.14
    ube
    -0.14
    POSITIVE LOGITS
    icon
    0.20
    on
    0.19
    eon
    0.19
    cons
    0.18
    icons
    0.18
    -k
    0.18
     کاÙĨ
    0.18
    oon
    0.18
    -cons
    0.17
    建è¨Ń
    0.17
    Act Density 0.062%

    No Known Activations