INDEX
    Explanations

    locations, such as countries and cities

    New Auto-Interp
    Negative Logits
    <bos>
    -2.57
    -0.87
     serve
    -0.69
    
    
    -0.66
    /***
    
    -0.63
    lateinit
    -0.61
    /**
    -0.61
    ždý
    -0.60
    Kontrola
    -0.59
     apply
    -0.59
    POSITIVE LOGITS
     Juf
    1.69
     bordeaux
    1.63
     maneu
    1.51
     Cæ
    1.50
     carrefour
    1.48
     marseille
    1.48
     emphat
    1.47
     Præ
    1.47
     wien
    1.47
     eiffel
    1.44
    Act Density 1.253%

    No Known Activations