INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ([],
    0.43
     yes
    0.39
     Yes
    0.37
     -=
    0.37
     Timeline
    0.37
    Unified
    0.36
    (**
    0.36
     ==
    0.35
     ?,
    0.35
       
    0.35
    POSITIVE LOGITS
     کرام
    0.48
     красоты
    0.47
    점에
    0.45
     لیا
    0.42
     точка
    0.42
     बिंदु
    0.42
     POINT
    0.42
    Laura
    0.41
    utacji
    0.40
     Laura
    0.40
    Act Density 0.009%

    No Known Activations