INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.10
    1:0.05
    2:0.07
    3:0.08
    4:0.08
    5:0.08
    6:0.09
    7:0.08
    8:0.07
    9:0.07
    10:0.09
    11:0.07
    Negative Logits
     bud
    -2.70
     Flor
    -2.65
     Fell
    -2.62
    rh
    -2.60
     Feel
    -2.52
     Dru
    -2.52
    hal
    -2.49
     Rey
    -2.48
     Iris
    -2.46
     Quan
    -2.46
    POSITIVE LOGITS
    kefeller
    3.26
     casinos
    3.21
    ufact
    3.14
    ataka
    2.91
    nikov
    2.91
     piston
    2.85
     TNT
    2.83
     Amtrak
    2.82
    ertodd
    2.78
     casino
    2.77
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.