INDEX
    Explanations

    eat, eating, ate, eaten

    New Auto-Interp
    Negative Logits
    0.64
    0.64
    V
    0.61
    0.61
     لاک
    0.60
    م
    0.60
    ک
    0.60
    Ч
    0.59
    Nicol
    0.57
    З
    0.56
    POSITIVE LOGITS
     eat
    0.88
     eating
    0.88
    0.81
     Eat
    0.80
     Eating
    0.79
     eats
    0.78
     makan
    0.75
     Eats
    0.74
    吃了
    0.73
     eaten
    0.72
    Act Density 0.046%

    No Known Activations