INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.09
    1:0.08
    2:0.08
    3:0.09
    4:0.07
    5:0.09
    6:0.07
    7:0.09
    8:0.08
    9:0.08
    10:0.08
    11:0.07
    Negative Logits
     Wagner
    -2.85
     exhibitions
    -2.77
     Kyoto
    -2.63
     Tsuk
    -2.51
    nai
    -2.48
     opera
    -2.37
     pleasures
    -2.36
     Bore
    -2.33
     postwar
    -2.33
     Vald
    -2.31
    POSITIVE LOGITS
    ichick
    2.68
     Taliban
    2.64
    FIR
    2.59
    iphate
    2.59
    hari
    2.59
    Phones
    2.38
     Explain
    2.34
    confirmed
    2.34
    Islamic
    2.31
    #$
    2.28
    Act Density 0.000%

    No Known Activations