INDEX
    Explanations

    math-related terms

    words and phrases that convey personal experiences or opinions

    New Auto-Interp
    Negative Logits
     streng
    -0.53
    ãĥ¼ãĥĨãĤ£
    -0.52
     elig
    -0.45
    ãĥīãĥ©
    -0.44
     concess
    -0.43
     perspect
    -0.43
     referen
    -0.43
     restrictive
    -0.42
     stringent
    -0.42
     predec
    -0.42
    POSITIVE LOGITS
    .",
    1.00
    .")
    1.00
    .,"
    0.99
    ,"
    0.98
    ."[
    0.97
    ,''
    0.96
    ."
    0.94
    .[
    0.93
    ".[
    0.92
    .),
    0.92
    Act Density 1.018%

    No Known Activations