INDEX
    Explanations

    instances of introductory phrases typically used in mathematical or logical statements

    New Auto-Interp
    Negative Logits
     Houſe
    -0.74
     Grecian
    -0.74
    wiſe
    -0.73
    ſelf
    -0.72
     himſelf
    -0.72
     nargin
    -0.72
     للاسماء
    -0.71
    >')
    -0.70
    ;");
    -0.69
    ^(@)
    -0.68
    POSITIVE LOGITS
     Let
    1.68
    Let
    1.63
     LET
    1.35
     let
    1.15
    LET
    0.95
    let
    0.94
    Lets
    0.84
    Пусть
    0.78
     Lets
    0.75
     Пусть
    0.74
    Act Density 0.184%

    No Known Activations