INDEX
    Explanations

    forms of verbs and their derivations, particularly focusing on past tense and gerunds

    New Auto-Interp
    Negative Logits
    lett
    -0.15
    ware
    -0.15
    inç
    -0.14
     Všech
    -0.14
    UTE
    -0.14
    á»§i
    -0.14
    èªĮ
    -0.14
    slu
    -0.14
    unya
    -0.13
    à¸ģ
    -0.13
    POSITIVE LOGITS
    tas
    0.15
     Hol
    0.15
    orious
    0.15
    uras
    0.14
    Ñīие
    0.14
    زÙĬد
    0.13
    aan
    0.13
     Rob
    0.13
     sum
    0.13
    edian
    0.13
    Act Density 0.629%

    No Known Activations