INDEX
    Explanations

    references to the concept of evolution

    New Auto-Interp
    Negative Logits
    +#+#
    -0.81
     masculinos
    -0.71
    nesty
    -0.70
     fraî
    -0.69
     Hopf
    -0.68
     engraçadas
    -0.66
     Jurí
    -0.65
     poitrine
    -0.65
    IntoConstraints
    -0.65
     Gegend
    -0.65
    POSITIVE LOGITS
     evolution
    2.03
     evolve
    1.86
    evolution
    1.83
     Evolution
    1.77
     EVOLUTION
    1.70
    Evolution
    1.65
     evolved
    1.62
     evolves
    1.60
     evolving
    1.59
     evolu
    1.57
    Act Density 0.168%

    No Known Activations