INDEX
    Explanations

    mentions of the word "first" and its variations

    New Auto-Interp
    Negative Logits
     méri
    -0.94
     Theſe
    -0.89
     Magdalene
    -0.88
     decorada
    -0.88
     ApJ
    -0.87
     Schemes
    -0.85
     Scrolls
    -0.85
     équilibr
    -0.85
    LEncoder
    -0.85
     Tales
    -0.84
    POSITIVE LOGITS
     First
    2.01
    FIRST
    1.92
     first
    1.90
    First
    1.89
     FIRST
    1.86
    first
    1.72
     first
    1.40
     getFirst
    1.26
     ersten
    1.25
     rst
    1.25
    Act Density 0.131%

    No Known Activations