INDEX
    Explanations

    Math problems

    New Auto-Interp
    Negative Logits
     quase
    -0.08
     presque
    -0.08
     bere
    -0.08
     anderer
    -0.08
     exceedingly
    -0.07
     zwe
    -0.07
     zdaj
    -0.07
     visceral
    -0.07
     panc
    -0.07
     incess
    -0.07
    POSITIVE LOGITS
     syst
    0.10
    ’ek
    0.07
     Patrol
    0.07
     Amph
    0.07
     Hollywood
    0.07
    nty
    0.07
     Countdown
    0.07
    awon
    0.07
    0.07
     Tiffany
    0.07
    Act Density 0.059%

    No Known Activations