INDEX
    Explanations

    numerical values or references to dates and scores related to sports or events

    New Auto-Interp
    Negative Logits
    _ATTR
    -0.15
    528
    -0.15
    oldt
    -0.15
    onitor
    -0.14
    è·¡
    -0.14
    avra
    -0.13
    _impl
    -0.13
    IX
    -0.13
    umba
    -0.13
    std
    -0.13
    POSITIVE LOGITS
     derby
    0.18
     disput
    0.18
     ragaz
    0.17
     play
    0.17
     debut
    0.16
     Coupe
    0.16
     marc
    0.16
     matches
    0.15
     tempor
    0.15
     è©
    0.15
    Act Density 0.036%

    No Known Activations