INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     currencies
    -0.07
     clothes
    -0.07
    לה
    -0.07
     nombres
    -0.07
     clothing
    -0.07
     vegetables
    -0.07
    城乡居民
    -0.07
    -0.07
     princes
    -0.07
    母亲
    -0.07
    POSITIVE LOGITS
     günc
    0.07
    _owner
    0.06
     frau
    0.06
    unfold
    0.06
    🔷
    0.06
    		     
    0.06
     democrat
    0.06
    },${
    0.06
     waypoints
    0.06
     outro
    0.06
    Act Density 0.004%

    No Known Activations