INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    velt
    -0.87
    ort
    -0.78
    vati
    -0.73
    intent
    -0.72
     Osw
    -0.70
    leans
    -0.68
     skelet
    -0.66
    rising
    -0.66
    apore
    -0.66
    resso
    -0.65
    POSITIVE LOGITS
     silence
    0.67
    _>
    0.67
    =(
    0.66
    none
    0.66
     Edition
    0.65
     radius
    0.60
     incomes
    0.60
     impunity
    0.59
    ²¾
    0.59
     indist
    0.59
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.