INDEX
    Explanations

    Paulo Freire

    New Auto-Interp
    Negative Logits
    >S
    -0.08
    <li
    -0.08
    Laugh
    -0.08
    numerusform
    -0.07
     apr
    -0.07
     Valentine's
    -0.07
    apr
    -0.07
     comedian
    -0.07
     חלק
    -0.07
    ਾਜ
    -0.07
    POSITIVE LOGITS
     cauliflower
    0.09
     reeks
    0.08
     dagger
    0.08
    Tha
    0.08
     wacht
    0.08
    ZG
    0.07
     ress
    0.07
     wyth
    0.07
     Basel
    0.07
    own
    0.07
    Act Density 0.000%

    No Known Activations