INDEX
    Explanations

    occurences of the word "four."

    New Auto-Interp
    Negative Logits
    ZL
    -0.63
    elis
    -0.55
    stdlib
    -0.55
     Estimation
    -0.55
    ivity
    -0.54
    genesis
    -0.52
    emia
    -0.52
    gentes
    -0.52
    belline
    -0.52
    zine
    -0.51
    POSITIVE LOGITS
     four
    1.13
     FOUR
    0.98
    four
    0.97
     Four
    0.91
    Four
    0.88
     quatre
    0.83
     quatro
    0.81
     dört
    0.81
    FOUR
    0.79
     vier
    0.79
    Act Density 0.016%

    No Known Activations