INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     or
    0.51
     masyarakat
    0.51
    ی
    0.50
     metabolism
    0.48
     tekanan
    0.47
     downsizing
    0.47
    ::
    0.46
     in
    0.45
    *.
    0.45
     $
    0.45
    POSITIVE LOGITS
     couleurs
    0.81
     colores
    0.62
    em
    0.61
    colours
    0.59
    و
    0.58
     colours
    0.57
    色的
    0.57
     Farbe
    0.56
     Sólo
    0.55
     Farb
    0.55
    Act Density 2.918%

    No Known Activations