INDEX
    Explanations

    phrases related to sports teams and players

    New Auto-Interp
    Negative Logits
    <bos>
    -1.52
     intersper
    -1.37
     endow
    -0.85
     gratify
    -0.84
    /***
    
    -0.83
     ascribe
    -0.80
     rouse
    -0.78
     banish
    -0.77
     harmonize
    -0.77
     acquaint
    -0.76
    POSITIVE LOGITS
     venuto
    0.98
     dimentic
    0.76
     rechange
    0.75
     riuscito
    0.72
     rimasto
    0.72
     sentito
    0.70
     potuto
    0.69
     pymongo
    0.68
     chrysler
    0.68
     innamor
    0.68
    Act Density 0.562%

    No Known Activations