INDEX
    Explanations

    a mix of monetary references, dates, and common prefixes/suffixes in words.

    New Auto-Interp
    Negative Logits
    #![
    -0.37
     pym
    -0.36
    lwz
    -0.35
    Marius
    -0.35
    +:+
    -0.35
     mips
    -0.35
     USART
    -0.35
     nero
    -0.34
     gql
    -0.34
     communis
    -0.33
    POSITIVE LOGITS
     hâte
    0.84
     démission
    0.84
     dépens
    0.83
     dégâts
    0.79
     rêves
    0.78
     écl
    0.78
     égard
    0.77
     deuil
    0.74
     autorité
    0.74
     larmes
    0.73
    Act Density 19.347%

    No Known Activations