INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Thing
    -0.08
     فرو
    -0.07
     měsíce
    -0.07
    ุมภาพ
    -0.07
    ubits
    -0.07
    .push
    -0.07
    266
    -0.07
    Fragment
    -0.06
    ngen
    -0.06
    (thing
    -0.06
    POSITIVE LOGITS
     care
    0.15
     Care
    0.13
    Care
    0.11
    care
    0.10
     CARE
    0.09
     healthcare
    0.09
    -care
    0.08
     caring
    0.08
    imore
    0.08
     caregivers
    0.08
    Act Density 0.040%

    No Known Activations