INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.04
    2:0.09
    3:0.09
    4:0.09
    5:0.06
    6:0.08
    7:0.11
    8:0.07
    9:0.07
    10:0.08
    11:0.09
    Negative Logits
    clinton
    -1.73
     favour
    -1.50
    Romney
    -1.45
     convinc
    -1.42
     refere
    -1.41
    ellen
    -1.39
     unconsciously
    -1.38
    ceived
    -1.37
     overest
    -1.36
    ��
    -1.36
    POSITIVE LOGITS
     Slug
    1.91
     Sanctuary
    1.46
     Prohibition
    1.41
     Varg
    1.37
    andise
    1.35
    nsics
    1.35
     wal
    1.35
     Lucius
    1.34
    ibal
    1.34
     Martial
    1.34
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.