INDEX
    Explanations

    Measurements

    New Auto-Interp
    Negative Logits
    .coll
    -0.07
    .pattern
    -0.06
     replic
    -0.06
     burst
    -0.06
    ami
    -0.06
    -0.06
    literal
    -0.06
     corpus
    -0.06
     Stones
    -0.06
    -left
    -0.06
    POSITIVE LOGITS
     будинку
    0.07
    ))]
    0.06
     nir
    0.06
    __,__
    0.06
    0.06
    0.06
    (usuario
    0.06
    ="../../../
    0.06
     Complaint
    0.06
    0.06
    Act Density 0.037%

    No Known Activations