INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     respons
    -0.80
     taxed
    -0.71
     indu
    -0.68
    slave
    -0.66
     presses
    -0.65
    forces
    -0.65
    iasco
    -0.64
     harassed
    -0.64
     symp
    -0.64
     persecuted
    -0.63
    POSITIVE LOGITS
     Built
    0.70
    Writing
    0.69
    â̲
    0.67
     Writing
    0.66
     Spaces
    0.65
     Ago
    0.65
     Achievements
    0.64
    alling
    0.63
     Fast
    0.63
    å
    0.63
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.