INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     juven
    -0.89
    interstitial
    -0.75
    die
    -0.68
    asar
    -0.66
    atan
    -0.66
     Whedon
    -0.64
     Drawn
    -0.63
    ailability
    -0.63
     Costume
    -0.61
    bane
    -0.61
    POSITIVE LOGITS
    laws
    0.73
    bringer
    0.64
    atcher
    0.64
     Painter
    0.63
     2048
    0.60
    rob
    0.59
     Sonia
    0.57
     offsets
    0.56
     Oblivion
    0.55
    oston
    0.54
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.