INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Efq
    -0.85
     doubtnut
    -0.85
     myſelf
    -0.82
     Jefus
    -0.79
    Reprodução
    -0.78
     Hift
    -0.75
    ſelf
    -0.73
     ſche
    -0.72
     itſelf
    -0.72
     themſelves
    -0.72
    POSITIVE LOGITS
    urlpatterns
    0.52
    tagext
    0.51
    most
    0.49
    tu
    0.48
    ande
    0.48
    matchCondition
    0.47
    se
    0.46
    cu
    0.46
    kom
    0.45
    cal
    0.44
    Act Density 0.447%

    No Known Activations