INDEX
    Explanations

    organizations, event names, and contacts in text

    New Auto-Interp
    Negative Logits
    <bos>
    -1.88
     intersper
    -1.16
    /**
    -0.87
    
    
    -0.85
     endow
    -0.83
     impelled
    -0.82
    <?
    -0.81
     overcrow
    -0.81
     underval
    -0.80
     disbur
    -0.78
    POSITIVE LOGITS
     cioc
    0.67
    Sklici
    0.65
    ihnachts
    0.64
     zub
    0.63
    erenc
    0.63
     tristes
    0.62
     Politica
    0.61
    ihnachten
    0.60
    zsef
    0.60
    acheco
    0.59
    Act Density 1.109%

    No Known Activations