INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ufact
    -0.96
    rites
    -0.81
     Tycoon
    -0.71
    ensions
    -0.69
    orpor
    -0.67
    gradation
    -0.66
    ortunately
    -0.66
    ected
    -0.66
    alty
    -0.64
    ©¶æ¥µ
    -0.64
    POSITIVE LOGITS
    naires
    1.51
    naire
    1.35
     answered
    1.23
    answered
    1.04
     posed
    1.02
     questions
    1.02
     asked
    1.00
     answ
    0.99
    Answer
    0.97
    answer
    0.96
    Act Density 0.088%

    No Known Activations