INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.08
    4:0.08
    5:0.09
    6:0.06
    7:0.09
    8:0.09
    9:0.07
    10:0.08
    11:0.09
    Negative Logits
    nown
    -3.21
    -3.17
    -3.15
    -3.13
    覚醒
    -3.11
    ipal
    -3.05
    ��
    -3.04
    irgin
    -3.04
    -3.03
    -2.98
    POSITIVE LOGITS
     exhibitions
    2.71
     integral
    2.63
     fluid
    2.57
     vital
    2.51
     Parliament
    2.50
     bout
    2.49
     intricate
    2.49
     retrospective
    2.49
     critically
    2.48
     applied
    2.46
    Act Density 0.000%

    No Known Activations