INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    akad
    0.37
     কাঠ
    0.36
    лер
    0.36
    環境
    0.36
    ń
    0.35
    kow
    0.34
     strukt
    0.34
     :/
    0.34
    ظيف
    0.34
    \
    0.33
    POSITIVE LOGITS
     direction
    0.60
     intensity
    0.47
     directions
    0.46
    方向
    0.46
    direction
    0.45
     intensities
    0.45
     directional
    0.45
    intensity
    0.44
     magnitude
    0.43
     направление
    0.43
    Act Density 0.000%

    No Known Activations