INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    eton
    -0.82
    odox
    -0.76
    isks
    -0.75
    isky
    -0.75
    SHIP
    -0.72
    acular
    -0.72
    IUM
    -0.71
    arium
    -0.71
    unn
    -0.69
    inav
    -0.68
    POSITIVE LOGITS
     folds
    0.62
     collagen
    0.59
     gelatin
    0.59
     drawer
    0.58
     metab
    0.58
     millisec
    0.58
    genic
    0.57
     fused
    0.57
     vein
    0.55
    atel
    0.55
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.