INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    t
    0.91
    H
    0.91
    S
    0.89
    R
    0.89
    N
    0.85
    F
    0.83
    j
    0.80
    p
    0.79
    X
    0.77
    E
    0.75
    POSITIVE LOGITS
     κόσ
    0.75
    istä
    0.73
     to
    0.73
     коли
    0.72
     PIB
    0.68
     μια
    0.66
     کا
    0.65
     penetrating
    0.65
     μία
    0.64
     GTI
    0.64
    Act Density 0.002%

    No Known Activations