INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    0.39
     thorns
    0.39
     syphilis
    0.39
    🐛
    0.38
     abrog
    0.38
    য়াত
    0.38
     गवर्न
    0.38
    0.38
    🐏
    0.37
     ech
    0.36
    POSITIVE LOGITS
     kitchen
    2.16
     Kitchen
    1.97
    kitchen
    1.95
    Kitchen
    1.94
     countertops
    1.84
     countertop
    1.77
     kitchens
    1.76
     किचन
    1.66
    キッチン
    1.65
     кух
    1.63
    Act Density 0.035%

    No Known Activations