INDEX
    Explanations

    the word "couple" in various contexts

    the word "couple" in various contexts

    New Auto-Interp
    Negative Logits
    schild
    -0.69
    aphael
    -0.68
    insula
    -0.65
    anwhile
    -0.65
    ulhu
    -0.64
    roup
    -0.63
     Directorate
    -0.63
    igmatic
    -0.62
    amaru
    -0.62
    kinson
    -0.62
    POSITIVE LOGITS
     dozen
    1.08
     hundred
    0.93
    dozen
    0.85
    ples
    0.82
     couple
    0.75
    tones
    0.75
     goats
    0.69
    Timeout
    0.68
     thirds
    0.68
    tons
    0.67
    Act Density 0.017%

    No Known Activations