INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     því
    -0.10
    _callable
    -0.09
     ზრდ
    -0.08
     فرو
    -0.08
     collectibles
    -0.08
     کودک
    -0.08
     ასევე
    -0.08
     کف
    -0.08
     حي
    -0.08
    өдөл
    -0.08
    POSITIVE LOGITS
    bay
    0.08
     approximation
    0.08
     Braun
    0.07
    -profile
    0.07
     bay
    0.07
     montagnes
    0.07
    0.07
     começar
    0.07
     approxim
    0.07
    ações
    0.07
    Act Density 0.002%

    No Known Activations