INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     correctes
    -0.83
    __':
    -0.77
    Revenir
    -0.76
    SOUNDBITE
    -0.74
    __":
    -0.73
    findpost
    -0.73
     (?,
    -0.73
    +#+#
    -0.72
    Datuak
    -0.71
    ="(
    -0.69
    POSITIVE LOGITS
    br
    1.10
    Br
    0.90
     Br
    0.87
     br
    0.83
     Brink
    0.77
     Branson
    0.71
     BR
    0.71
     Brind
    0.71
     Obrador
    0.67
     brine
    0.66
    Act Density 0.045%

    No Known Activations