INDEX
    Explanations

    phrases related to sports and competition

    phrases indicating duration or recurrent events

    New Auto-Interp
    Negative Logits
    \\\\\\\\
    -0.62
     :(
    -0.57
     Mub
    -0.57
    .�
    -0.56
    .")
    -0.55
     looph
    -0.55
     Adin
    -0.54
     [];
    -0.54
    Ö¼
    -0.54
    eve
    -0.54
    POSITIVE LOGITS
     respectively
    0.87
     etc
    0.82
    ?,
    0.80
    etheless
    0.69
     remains
    0.67
    odan
    0.66
     becomes
    0.65
    ?),
    0.64
     seems
    0.63
    thing
    0.60
    Act Density 0.934%

    No Known Activations