INDEX
    Explanations

    the beginning of new sections or paragraphs in a text

    New Auto-Interp
    Negative Logits
     myself
    -0.53
     yourself
    -0.52
     καν
    -0.51
    athen
    -0.49
    hamshire
    -0.47
     minta
    -0.47
     Anybody
    -0.47
    box
    -0.47
     fast
    -0.45
     Myself
    -0.45
    POSITIVE LOGITS
     receita
    0.69
    extAlignment
    0.64
     الرياضيه
    0.62
     receitas
    0.62
     pylint
    0.61
    Personensuche
    0.61
    ötzlich
    0.60
     inflación
    0.60
    ypress
    0.60
     समीक्षक
    0.59
    Act Density 0.134%

    No Known Activations