INDEX
    Explanations

    variable and equation representations in mathematical contexts

    New Auto-Interp
    Negative Logits
    >>)
    -0.15
    lag
    -0.15
    ');↵
    -0.14
    ,:);↵
    -0.14
    ocale
    -0.14
     GOODMAN
    -0.14
    LAG
    -0.14
    tam
    -0.14
    velop
    -0.13
    aye
    -0.13
    POSITIVE LOGITS
    )]
    0.29
    )}
    0.21
    )]↵
    0.20
    }}
    0.19
    )];
    0.18
    )}"↵
    0.18
    .")]↵
    0.18
    ]]
    0.18
     )]↵
    0.18
    ']}↵
    0.17
    Act Density 0.133%

    No Known Activations