INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.07
    4:0.07
    5:0.08
    6:0.10
    7:0.08
    8:0.08
    9:0.08
    10:0.07
    11:0.07
    Negative Logits
    changes
    -1.87
     Andersen
    -1.68
     Meyer
    -1.61
     Lerner
    -1.58
    -1.54
     Dare
    -1.52
     Framework
    -1.52
    Fram
    -1.52
     Zup
    -1.50
     Doe
    -1.50
    POSITIVE LOGITS
    nikov
    1.97
    ecause
    1.92
    ategory
    1.86
    vertisement
    1.84
    arcity
    1.80
    ividual
    1.79
    obbies
    1.73
    aughs
    1.71
    tainment
    1.70
    senal
    1.70
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.