INDEX
    Explanations

    punctuation marks and special symbols

    New Auto-Interp
    Negative Logits
    ftagPool
    -1.16
     ―――――
    -1.13
    AndEndTag
    -1.04
     Réponses
    -1.00
     utafitiHapana
    -1.00
    Билгалдахарш
    -0.98
    awtextra
    -0.97
     photolibrary
    -0.96
     doubtnut
    -0.96
    Autoritní
    -0.94
    POSITIVE LOGITS
    ↵↵
    0.67
    <eos>
    0.66
     F
    0.53
     wa
    0.52
    inn
    0.52
     G
    0.50
     Ba
    0.50
     N
    0.50
     am
    0.49
    zulegen
    0.49
    Act Density 0.006%

    No Known Activations