INDEX
    Explanations

    references to various sports, particularly baseball and basketball

    New Auto-Interp
    Negative Logits
     football
    -0.18
     baseball
    -0.18
     basketball
    -0.17
     rugby
    -0.17
     Baseball
    -0.17
     hockey
    -0.17
     tennis
    -0.16
    äd
    -0.16
     Hockey
    -0.15
    adele
    -0.15
    POSITIVE LOGITS
    /base
    0.23
    -playing
    0.22
    -reference
    0.19
    /Base
    0.19
    bum
    0.16
    λε
    0.16
    åĵ¡
    0.16
    /music
    0.15
    -related
    0.15
    -loving
    0.15
    Act Density 0.054%

    No Known Activations