INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     biv
    -0.09
     comm
    -0.08
    tryk
    -0.08
     vòng
    -0.08
     leat
    -0.07
     Leader
    -0.07
     Roll
    -0.07
     naye
    -0.07
     Self
    -0.07
     കമ്പ
    -0.07
    POSITIVE LOGITS
    åt
    0.08
    athan
    0.08
    0.07
    akati
    0.07
     san
    0.07
    	cancel
    0.07
     Ferd
    0.07
    vänd
    0.07
    daemon
    0.07
    .start
    0.07
    Act Density 0.010%

    No Known Activations