INDEX
    Explanations

    questions and statements in a text

    New Auto-Interp
    Negative Logits
    ufact
    -0.81
    photos
    -0.71
    olia
    -0.67
    robe
    -0.66
    ilon
    -0.65
    natureconservancy
    -0.63
    Lago
    -0.62
    ool
    -0.61
     Sigma
    -0.61
    Weapons
    -0.61
    POSITIVE LOGITS
     answered
    1.57
    answered
    1.48
     unanswered
    1.45
    answer
    1.44
     answering
    1.29
    Answer
    1.29
     answers
    1.28
     answ
    1.26
     asked
    1.19
     Answers
    1.13
    Act Density 0.190%

    No Known Activations