INDEX
    Explanations

    different languages

    Past tense verbs

    New Auto-Interp
    Negative Logits
     would
    -0.85
    would
    -0.81
     will
    -0.75
     Would
    -0.71
     оригіналу
    -0.71
     fhould
    -0.71
     WOULD
    -0.66
    ShouldBe
    -0.66
     can
    -0.65
    exitRule
    -0.65
    POSITIVE LOGITS
     was
    0.95
     did
    0.94
     came
    0.91
     took
    0.90
     went
    0.87
     gave
    0.84
     began
    0.81
     ended
    0.79
    Did
    0.77
    did
    0.76
    Act Density 0.499%

    No Known Activations