INDEX
    Explanations

    phrases indicating knowledge or expertise in various subjects

    claims of knowledge or understanding about various subjects

    New Auto-Interp
    Negative Logits
     Featured
    -0.72
    hement
    -0.70
    atform
    -0.70
    cation
    -0.68
    yrim
    -0.66
    erate
    -0.65
    ittal
    -0.65
    ission
    -0.64
    vertisement
    -0.63
    eworthy
    -0.63
    POSITIVE LOGITS
     intimately
    0.88
     whereabouts
    0.83
     firsthand
    0.81
    CHAT
    0.77
     beforehand
    0.70
    âĨij
    0.69
     drill
    0.68
     instinctively
    0.68
     secret
    0.67
    æĿ
    0.67
    Act Density 0.214%

    No Known Activations