INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     such
    -2.00
    such
    -1.62
    Such
    -1.52
     SUCH
    -1.37
     Such
    -1.34
     solche
    -1.13
     такие
    -1.09
     sådan
    -1.05
     такой
    -1.05
     solchen
    -1.05
    POSITIVE LOGITS
     CreateTagHelper
    0.59
    ()])
    0.48
     فريبيس
    0.48
     zitten
    0.47
    UMENTO
    0.47
     betweenstory
    0.46
    midos
    0.46
    LLES
    0.45
     Sca
    0.45
     henvisninger
    0.44
    Act Density 0.525%

    No Known Activations