INDEX
    Explanations

    mentions of educational institutions and professional roles

    New Auto-Interp
    Negative Logits
    <bos>
    -1.45
    -0.83
     springfox
    -0.72
     naudoti
    -0.71
     įsi
    -0.65
     overcrow
    -0.65
    <?
    -0.62
     nė
    -0.62
    
    
    -0.61
     thicken
    -0.61
    POSITIVE LOGITS
     Mejía
    0.86
     Khart
    0.84
     Minang
    0.82
     Ribera
    0.81
    emmel
    0.80
     Bahía
    0.80
     Meksi
    0.76
     Cár
    0.76
     Mentre
    0.76
     véhic
    0.76
    Act Density 0.404%

    No Known Activations