INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.07
    2:0.08
    3:0.09
    4:0.07
    5:0.07
    6:0.09
    7:0.06
    8:0.09
    9:0.07
    10:0.09
    11:0.07
    Negative Logits
     Dota
    -3.01
    Frames
    -2.99
     Overwatch
    -2.89
     StarCraft
    -2.74
     Starcraft
    -2.73
     Arkham
    -2.69
     dystopian
    -2.63
    sych
    -2.60
     cereal
    -2.56
    Dream
    -2.56
    POSITIVE LOGITS
    nels
    3.16
    nc
    2.95
    tn
    2.74
    ..............
    2.69
    nel
    2.60
    psey
    2.57
    unt
    2.55
    vt
    2.54
    arse
    2.48
    INC
    2.48
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.