INDEX
    Explanations

    patterns that start with special characters and involve a sense or perception

    special characters or glyphs that may indicate a certain tone or emphasis in the text

    New Auto-Interp
    Negative Logits
     disse
    -0.79
     seiz
    -0.79
    ãĥ¯ãĥ³
    -0.76
     snail
    -0.71
     obser
    -0.71
     Franch
    -0.70
     scor
    -0.68
     hemor
    -0.65
     dehuman
    -0.64
    icit
    -0.63
    POSITIVE LOGITS
    Ŀ
    1.58
    ¡
    1.19
    Ĵ
    1.01
    ľ
    0.98
    ī
    0.98
    ¤
    0.97
    Ĩ
    0.95
    Ķ
    0.94
    ¦
    0.94
    ĺ
    0.92
    Act Density 0.299%

    No Known Activations