INDEX
    Explanations

    Abbreviations and names

    New Auto-Interp
    Negative Logits
     ne
    -1.00
    ne
    -1.00
    ness
    -0.93
    su
    -0.90
     NE
    -0.88
    NE
    -0.85
    nes
    -0.84
    nev
    -0.74
    -0.71
     Ne
    -0.69
    POSITIVE LOGITS
     ThemeData
    0.86
     sabbia
    0.83
     fermés
    0.79
     vectorielles
    0.75
     varandra
    0.75
     siguran
    0.75
     vastaan
    0.75
     copertina
    0.74
     HttpNotFound
    0.74
     régl
    0.74
    Act Density 0.225%

    No Known Activations