INDEX
    Explanations

    questions or statements that start with "what" or "why" and relate to reasoning or opinions

    New Auto-Interp
    Negative Logits
    memoized
    -0.71
    IndentedString
    -0.59
    Clik
    -0.54
     Мексичка
    -0.54
    hilangan
    -0.53
    expandindo
    -0.53
    Portály
    -0.51
     δὲ
    -0.51
    BeginContext
    -0.50
    enterOuterAlt
    -0.49
    POSITIVE LOGITS
     للمعارف
    0.61
    DoubleQuotes
    0.60
    satunya
    0.59
     why
    0.58
     السب
    0.56
     właśnie
    0.54
     reason
    0.51
     فريبيس
    0.51
    ziplin
    0.51
     Einbau
    0.50
    Act Density 0.318%

    No Known Activations