INDEX
    Explanations

    the word "se" with various endings

    the word "see" in various contexts

    New Auto-Interp
    Negative Logits
    initely
    -0.80
    INGTON
    -0.77
    ashtra
    -0.73
    enegger
    -0.73
    enhagen
    -0.72
     hoops
    -0.72
    SHIP
    -0.69
    £ı
    -0.69
    eanor
    -0.68
    etheless
    -0.68
    POSITIVE LOGITS
    eps
    1.04
    vel
    0.96
    perate
    0.95
    ve
    0.94
    leanor
    0.92
    xt
    0.91
    wed
    0.91
    rend
    0.90
    eker
    0.90
    vent
    0.89
    Act Density 0.012%

    No Known Activations