INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     but
    -0.20
     Therefore
    -0.18
     so
    -0.18
     therefore
    -0.17
    onga
    -0.16
     wiÄĻc
    -0.16
     So
    -0.16
     pero
    -0.15
     BUT
    -0.15
    Therefore
    -0.15
    POSITIVE LOGITS
     Answers
    0.31
     answers
    0.30
     Answer
    0.29
     ANSW
    0.29
     answering
    0.28
    çŃĶæ¡Ī
    0.28
    Answers
    0.27
     answer
    0.27
     answered
    0.26
     çŃĶ
    0.24
    Act Density 0.073%

    No Known Activations