INDEX
    Explanations

    instances of special characters and formatting tokens in the text

    New Auto-Interp
    Negative Logits
     samar
    -0.65
    lesssim
    -0.64
     ne
    -0.63
     bra
    -0.63
     model
    -0.62
     ser
    -0.62
    elry
    -0.62
     front
    -0.61
     ang
    -0.61
     Levin
    -0.60
    POSITIVE LOGITS
     varandra
    0.88
    enumi
    0.85
     bakgrund
    0.84
    antaranya
    0.81
     consultato
    0.78
     maș
    0.78
     myö
    0.76
     häls
    0.74
     térmico
    0.74
     vulga
    0.73
    Act Density 0.062%

    No Known Activations