INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Stores
    -0.76
     transfers
    -0.72
     Luis
    -0.72
     slipped
    -0.68
     да
    -0.67
    вод
    -0.66
     transferred
    -0.65
    complish
    -0.65
    дох
    -0.65
     ต
    -0.65
    POSITIVE LOGITS
    Rap
    1.55
     Rap
    1.21
     rap
    1.20
     rapping
    1.18
    rap
    1.09
     rapp
    1.03
    RAP
    1.03
     Rapp
    1.02
    urous
    0.96
     raps
    0.96
    Act Density 0.024%

    No Known Activations