INDEX
    Explanations

    Swahili names and words

    New Auto-Interp
    Negative Logits
     enth
    0.76
     induces
    0.73
     ec
    0.72
     especific
    0.69
     triggers
    0.69
    జ్య
    0.68
     conjectures
    0.68
     incompatible
    0.68
     plasm
    0.67
     indu
    0.67
    POSITIVE LOGITS
     kwa
    1.22
     hizo
    1.18
     katika
    1.13
     milioni
    1.10
     kwenye
    1.08
     kutoka
    1.02
     maand
    1.00
     hata
    0.99
     kama
    0.98
     programu
    0.98
    Act Density 0.008%

    No Known Activations