INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     maatregelen
    -0.08
    	results
    -0.08
     tiltak
    -0.08
    acceptable
    -0.08
     synonyms
    -0.08
     resultaten
    -0.08
     prognosis
    -0.08
     kia
    -0.08
     enhancements
    -0.07
    ','=','
    -0.07
    POSITIVE LOGITS
     Che
    0.08
    Tar
    0.07
     unsuccess
    0.07
     Herr
    0.07
     stuff
    0.07
     Freed
    0.07
     peacefully
    0.07
    0.07
    Che
    0.07
     Cheer
    0.07
    Act Density 0.029%

    No Known Activations