INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ÑĦÑĥнда
    -0.21
    alie
    -0.19
    amburger
    -0.15
    GenerationStrategy
    -0.14
    ::__
    -0.14
    olet
    -0.14
    datum
    -0.13
    OCI
    -0.13
    oldur
    -0.13
    verige
    -0.13
    POSITIVE LOGITS
     Lenin
    0.24
     Len
    0.20
     Soviet
    0.20
     Party
    0.19
    Len
    0.19
     len
    0.18
     GPU
    0.17
     ÐĽÐµÐ½Ð¸
    0.17
     Stalin
    0.17
     USSR
    0.17
    Act Density 0.051%

    No Known Activations