INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    laze
    -0.77
    -0.73
     equilibrium
    -0.73
    -0.72
     тю
    -0.71
     IPT
    -0.71
    -0.71
     Ochoa
    -0.71
    bera
    -0.70
     Zep
    -0.70
    POSITIVE LOGITS
     long
    2.44
    long
    1.94
    Long
    1.91
     Long
    1.83
    LONG
    1.57
    1.54
    1.48
     LONG
    1.41
     длин
    1.41
     dlou
    1.40
    Act Density 0.021%

    No Known Activations