INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     resourceCulture
    -0.98
     propOrder
    -0.87
     oreilles
    -0.72
     chré
    -0.71
     consommateurs
    -0.71
     comportements
    -0.71
     stället
    -0.69
     indépendante
    -0.68
     blessures
    -0.66
     religieuses
    -0.66
    POSITIVE LOGITS
    ly
    0.77
    ment
    0.71
    board
    0.71
    ry
    0.70
    ous
    0.68
    let
    0.66
    ful
    0.65
    e
    0.65
    xic
    0.63
    tin
    0.63
    Act Density 0.093%

    No Known Activations