INDEX
    Explanations

    the most important thing

    New Auto-Interp
    Negative Logits
     Затем
    0.87
    Then
    0.76
     Depuis
    0.75
     затем
    0.74
    Since
    0.72
     Then
    0.72
     Since
    0.70
     занимает
    0.70
     Became
    0.70
    лов
    0.69
    POSITIVE LOGITS
     best
    1.41
     onus
    1.25
     only
    1.23
     greatest
    1.20
     key
    1.15
     ideal
    1.11
     healthiest
    1.10
     mere
    1.05
     danger
    1.02
     absence
    1.02
    Act Density 0.068%

    No Known Activations