INDEX
    Explanations

    foreign languages, punctuation

    New Auto-Interp
    Negative Logits
    Also
    0.40
     также
    0.39
    也可
    0.39
     Также
    0.38
    Также
    0.38
     also
    0.38
     역시
    0.37
    \
    0.37
     також
    0.36
     greatly
    0.36
    POSITIVE LOGITS
     těch
    0.46
     những
    0.44
     habitudes
    0.43
    那些
    0.41
    0.40
     마치
    0.39
     כמו
    0.39
     aquellas
    0.39
     subtleties
    0.39
    就像
    0.38
    Act Density 0.031%

    No Known Activations