INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     ſche
    -0.75
    ScopeManager
    -0.64
    ſelves
    -0.60
     Favre
    -0.60
     pleaſure
    -0.60
    typeparam
    -0.59
     deleteUser
    -0.59
    pthread
    -0.58
     pthread
    -0.57
    Nathalie
    -0.57
    POSITIVE LOGITS
     is
    1.30
     Is
    0.99
    is
    0.97
    Is
    0.94
     are
    0.79
     IS
    0.77
     has
    0.71
     was
    0.63
    0.59
     è
    0.58
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.