INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    recommend
    -0.06
    oretical
    -0.06
     برخورد
    -0.06
    -0.06
    section
    -0.06
     Violence
    -0.06
     bleak
    -0.06
    _properties
    -0.06
    iom
    -0.06
    ủy
    -0.06
    POSITIVE LOGITS
     Instant
    0.17
     instantly
    0.17
     instant
    0.16
    Instant
    0.14
    instant
    0.12
    .instant
    0.09
     INST
    0.09
     :"
    0.08
    XT
    0.08
    InstantiationException
    0.08
    Act Density 0.005%

    No Known Activations