INDEX
    Explanations

    features, blogs, timer, channels

    New Auto-Interp
    Negative Logits
     integers
    0.46
     imposed
    0.42
     defined
    0.42
     necessary
    0.41
     needless
    0.41
     emphasized
    0.41
     ceased
    0.40
     inflicted
    0.40
     priorities
    0.39
     became
    0.39
    POSITIVE LOGITS
     можно
    0.60
    会有
    0.57
     يمكنك
    0.56
    會有
    0.56
    你可以
    0.55
     можна
    0.54
    에서도
    0.54
     поможет
    0.54
     можете
    0.53
     puedes
    0.52
    Act Density 0.303%

    No Known Activations