INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    орош
    -0.07
     vectors
    -0.07
     Chains
    -0.07
     додатков
    -0.06
    _testing
    -0.06
    _repo
    -0.06
    _BINARY
    -0.06
    _face
    -0.06
    _topic
    -0.06
    ut
    -0.06
    POSITIVE LOGITS
    resultado
    0.07
    (\'
    0.06
    panel
    0.06
    ellan
    0.06
    ôt
    0.06
     %#
    0.06
     ah
    0.06
     Địa
    0.06
     ethnicity
    0.06
     //*
    0.06
    Act Density 0.006%

    No Known Activations