INDEX
    Explanations

    IP addresses

    New Auto-Interp
    Negative Logits
     unanim
    -0.07
    💔
    -0.07
    -0.07
    เฉพาะ
    -0.07
    Jamie
    -0.07
    חר
    -0.06
    工信
    -0.06
    免税
    -0.06
     Too
    -0.06
    汽车产业
    -0.06
    POSITIVE LOGITS
     Кор
    0.07
    scaled
    0.07
    });↵↵
    0.07
    =my
    0.07
    categoria
    0.07
     worldly
    0.07
     writings
    0.07
     Scenes
    0.06
     görüşme
    0.06
     Shan
    0.06
    Act Density 0.006%

    No Known Activations