INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     will
    -1.77
    .
    -1.32
    as
    -1.31
     sottile
    -1.27
    We
    -1.24
    $.
    -1.23
    It
    -1.21
     These
    -1.20
    mathrm
    -1.20
     mètres
    -1.19
    POSITIVE LOGITS
    𐀀
    1.66
     francia
    1.45
    цыі
    1.44
    katapos
    1.43
     procé
    1.38
     glimmer
    1.37
    zten
    1.34
    どのような
    1.33
    mvh
    1.30
    1.30
    Act Density 0.002%

    No Known Activations