INDEX
    Explanations

    baby and associated terms

    New Auto-Interp
    Negative Logits
    the
    1.36
     powied
    1.30
    ו
    1.28
    1.21
    1.16
    ה
    1.16
    ל
    1.16
     the
    1.15
     it
    1.11
    1.08
    POSITIVE LOGITS
     Baby
    1.21
    েন
    1.13
    很多
    1.05
    ick
    1.04
    ة
    1.00
    y
    0.99
    一些
    0.98
    datei
    0.98
    stoffen
    0.95
    婴儿
    0.94
    Act Density 0.007%

    No Known Activations