INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    sover
    -0.09
     Elementary
    -0.08
     Gro
    -0.08
    gra
    -0.07
    -0.07
    atility
    -0.07
    Elementary
    -0.07
    trust
    -0.07
     IBM
    -0.07
    -associated
    -0.07
    POSITIVE LOGITS
     Perspective
    0.08
     tilted
    0.08
    Perspective
    0.08
     imprint
    0.08
     verso
    0.08
    Intent
    0.08
    idhe
    0.08
    0.08
     vérité
    0.08
    untas
    0.08
    Act Density 0.003%

    No Known Activations