INDEX
    Explanations

    placeholders for signatures and dates

    New Auto-Interp
    Negative Logits
    &-
    0.41
    ких
    0.41
    Likewise
    0.40
    --
    0.39
    lessly
    0.39
     resist
    0.38
    ]|
    0.38
    =:
    0.38
    ियंस
    0.37
     говорил
    0.37
    POSITIVE LOGITS
    _______________
    0.55
     ________
    0.53
     ____________
    0.53
    _________
    0.53
     _______
    0.50
     ____
    0.50
    ________
    0.46
     ______
    0.45
     _____________
    0.45
     __________
    0.45
    Act Density 0.000%

    No Known Activations