INDEX
    Explanations

    actions related to understanding and decision-making processes

    preceding questions

    knows what/how/when/where/who/if

    New Auto-Interp
    Negative Logits
    being
    -0.43
    OptionsMenu
    -0.43
    you
    -0.40
    mpi
    -0.40
    dbo
    -0.40
    phanumeric
    -0.39
    -0.39
    ノロ
    -0.39
    telegram
    -0.38
    gl
    -0.38
    POSITIVE LOGITS
     what
    1.36
     which
    1.02
     how
    0.99
     where
    0.97
     exactly
    0.95
     CreateTagHelper
    0.84
     whether
    0.81
     when
    0.80
     WHICH
    0.80
     if
    0.77
    Act Density 0.246%

    No Known Activations