INDEX
    Explanations

    phrases related to physical activities and achievements

    New Auto-Interp
    Negative Logits
    <bos>
    -4.54
    -1.34
    <?
    -1.22
    
    
    -1.17
    /***
    
    -1.12
    /**
    -1.09
    /*!
    
    -0.97
     intersper
    -0.93
     disbur
    -0.88
     springfox
    -0.87
    POSITIVE LOGITS
     corrom
    0.86
     seksi
    0.86
     tristes
    0.78
     marea
    0.76
     maroc
    0.76
     saar
    0.75
     ceramica
    0.74
     vasi
    0.74
     uhr
    0.74
     kafe
    0.74
    Act Density 1.335%

    No Known Activations