INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     VAL
    -0.80
     insulator
    -0.77
     Val
    -0.75
    -0.69
     પ્ર
    -0.69
    Dto
    -0.68
     insu
    -0.66
     Dub
    -0.65
     proced
    -0.65
     OCD
    -0.65
    POSITIVE LOGITS
     иностранных
    0.78
    ած
    0.73
    DEAD
    0.70
    laki
    0.68
     tangki
    0.67
    itahu
    0.67
    entreprises
    0.66
    0.66
     Elimination
    0.66
    تفسير
    0.66
    Act Density 0.097%

    No Known Activations