INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     marquées
    -0.61
     gouttes
    -0.58
     lèvres
    -0.57
     mères
    -0.54
    følgelig
    -0.54
     skär
    -0.54
     contactez
    -0.53
    nexo
    -0.53
     réguli
    -0.53
     devenus
    -0.52
    POSITIVE LOGITS
     opinions
    0.86
    0.85
    '
    0.75
     viewpoints
    0.73
     perspectives
    0.70
     views
    0.69
     approval
    0.68
     feelings
    0.67
     opinion
    0.67
     thoughts
    0.67
    Act Density 0.001%

    No Known Activations