INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Кре
    -0.71
    Hel
    -0.68
     Hel
    -0.68
    ıldığı
    -0.67
     junit
    -0.65
     증
    -0.65
    民国
    -0.64
     سعود
    -0.63
    shadowOpacity
    -0.63
    -0.63
    POSITIVE LOGITS
     needle
    6.34
     needles
    5.44
    needle
    4.88
     Needle
    4.78
    Needle
    4.63
     Needles
    3.94
    3.31
    3.27
     aguja
    2.72
     agujas
    2.22
    Act Density 0.060%

    No Known Activations