INDEX
    Explanations

    phrases related to asking about specific details or characteristics

    the word "that" in various contexts

    New Auto-Interp
    Negative Logits
     Pass
    -0.59
     stall
    -0.54
    hat
    -0.53
     Sad
    -0.53
     Uk
    -0.53
     Hall
    -0.52
     Pace
    -0.51
     Nay
    -0.51
    IVERS
    -0.51
     Jam
    -0.50
    POSITIVE LOGITS
     fateful
    0.85
    soever
    0.83
    eatures
    0.81
    chers
    0.80
     mattered
    0.77
     surrounds
    0.77
     resulted
    0.77
     accompanies
    0.76
     arose
    0.75
     corresponds
    0.75
    Act Density 0.407%

    No Known Activations