INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     brill
    -0.81
    sonian
    -0.77
    lov
    -0.74
     communism
    -0.72
    cule
    -0.69
     Communism
    -0.68
    aque
    -0.68
     Chavez
    -0.68
     toxin
    -0.65
    lys
    -0.63
    POSITIVE LOGITS
    ows
    1.25
    owed
    0.76
    ãĥł
    0.74
    OWS
    0.67
    OW
    0.67
    xtap
    0.67
    orted
    0.66
    ategories
    0.66
    ower
    0.66
    owing
    0.66
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.