INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.09
    1:0.08
    2:0.08
    3:0.08
    4:0.09
    5:0.09
    6:0.06
    7:0.08
    8:0.09
    9:0.06
    10:0.08
    11:0.07
    Negative Logits
    eatures
    -2.13
    gaard
    -2.10
     withd
    -1.89
    -1.88
    properties
    -1.84
    compl
    -1.83
    stood
    -1.81
    ohl
    -1.77
    iband
    -1.72
    ancies
    -1.71
    POSITIVE LOGITS
     enslaved
    1.83
     Negro
    1.76
     circus
    1.75
    emic
    1.74
     Invasion
    1.74
     rob
    1.73
    1.68
     threatening
    1.68
    1.66
     Mafia
    1.65
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.