INDEX
    Explanations

    multiple languages

    New Auto-Interp
    Negative Logits
    ."<
    -0.08
     uncle
    -0.08
     cust
    -0.08
    abs
    -0.08
    воль
    -0.08
    ply
    -0.07
     desap
    -0.07
    ваў
    -0.07
    ाले
    -0.07
    جيب
    -0.07
    POSITIVE LOGITS
    .Cast
    0.08
    _USAGE
    0.08
    0.08
     επι
    0.07
    .Keyword
    0.07
     ситуация
    0.07
     Situation
    0.07
     Beginner
    0.07
     Lik
    0.07
     до
    0.07
    Act Density 0.012%

    No Known Activations