INDEX
    Explanations

    references to social connections and relationships

    New Auto-Interp
    Negative Logits
    <bos>
    -2.02
    /***
    
    -0.64
     neutralize
    -0.52
     enshr
    -0.49
    -0.49
     knelt
    -0.48
     minimise
    -0.48
    Agua
    -0.48
     mobilize
    -0.48
     modulate
    -0.47
    POSITIVE LOGITS
     jetta
    1.02
     riva
    1.02
     Minang
    0.99
     Græ
    0.99
     lele
    0.97
     sentra
    0.91
     Palembang
    0.90
     Meksi
    0.89
     brune
    0.89
     croce
    0.89
    Act Density 0.691%

    No Known Activations