INDEX
    Explanations

    the beginning of new sections or topics in a text

    New Auto-Interp
    Negative Logits
    <bos>
    -1.06
     estadounid
    -0.44
     queſta
    -0.39
     indígen
    -0.37
     ſei
    -0.37
    ρυ
    -0.37
     increí
    -0.36
    úb
    -0.36
     mauva
    -0.35
    ########.
    -0.35
    POSITIVE LOGITS
    ^(@)
    0.35
    drawal
    0.34
    ficulty
    0.32
    csolódó
    0.31
    >\<^
    0.30
    uests
    0.29
    ideration
    0.29
     [],
    
    0.29
    ignty
    0.29
    %\]
    0.28
    Act Density 42.632%

    No Known Activations