INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     $?
    -0.06
     vời
    -0.06
     Cross
    -0.06
    *****/↵
    -0.06
     unn
    -0.06
     behalf
    -0.06
    tape
    -0.06
     artık
    -0.06
     Hey
    -0.06
    τικός
    -0.06
    POSITIVE LOGITS
    .protocol
    0.07
    	View
    0.07
    bindung
    0.06
    administration
    0.06
    TRACT
    0.06
     Hoff
    0.06
    ンブ
    0.06
    -transfer
    0.06
    áze
    0.06
     Communist
    0.06
    Act Density 0.004%

    No Known Activations