INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.04
    2:0.09
    3:0.08
    4:0.09
    5:0.07
    6:0.08
    7:0.08
    8:0.10
    9:0.07
    10:0.08
    11:0.08
    Negative Logits
     fuss
    -1.94
    orum
    -1.81
     budget
    -1.74
    actionDate
    -1.66
    20439
    -1.65
    ��
    -1.63
    rieg
    -1.55
    othal
    -1.54
     throats
    -1.54
    "]=>
    -1.47
    POSITIVE LOGITS
    cas
    1.53
    idelity
    1.42
    eval
    1.40
     Converted
    1.35
     Intermediate
    1.34
     poisonous
    1.34
    ocating
    1.33
     unhealthy
    1.31
     uncom
    1.31
    nova
    1.31
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.