INDEX
    Explanations

    mathematical expressions and calculations

    nested structures and parentheses in the text

    New Auto-Interp
    Negative Logits
     Maced
    -0.66
     Coins
    -0.63
     Buckingham
    -0.62
     Sik
    -0.60
     Sparrow
    -0.60
     Paper
    -0.59
     Nile
    -0.59
     Wiki
    -0.59
     Ang
    -0.58
     Agriculture
    -0.57
    POSITIVE LOGITS
    ])
    1.26
    )))
    1.24
    )"
    1.23
    ))
    1.20
    >)
    1.19
    )]
    1.18
    ?)
    1.14
     ))
    1.14
    ?),
    1.13
    ")
    1.13
    Act Density 0.097%

    No Known Activations