INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     Rai
    -0.78
    lehem
    -0.70
    igun
    -0.70
     corrid
    -0.69
     enthusi
    -0.69
    ppo
    -0.68
    zbollah
    -0.68
    zona
    -0.67
    ombo
    -0.67
     bom
    -0.65
    POSITIVE LOGITS
    owed
    0.83
    hold
    0.68
    backs
    0.66
    igned
    0.62
     Furn
    0.61
     Dull
    0.61
    iors
    0.60
    aken
    0.60
    200000
    0.59
    atical
    0.58
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.