INDEX
    Explanations

    coordinate system transformations

    New Auto-Interp
    Negative Logits
    itrust
    -0.08
    _equal
    -0.08
     સાચ
    -0.08
    ellect
    -0.08
     judgment
    -0.08
     kiểm
    -0.08
     deserving
    -0.07
    .equal
    -0.07
     провер
    -0.07
     discrimination
    -0.07
    POSITIVE LOGITS
     selber
    0.08
     Galleries
    0.08
    chelles
    0.08
     Self
    0.08
     फ्ल
    0.08
     selbst
    0.08
    #.
    0.07
     Coordinates
    0.07
     Paj
    0.07
     doll
    0.07
    Act Density 0.008%

    No Known Activations