INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    aarrggbb
    -0.54
     autorytatywna
    -0.53
    Rptr
    -0.53
    Biografia
    -0.49
     AssemblyCompany
    -0.45
    abestanden
    -0.44
    jsonwebtoken
    -0.44
    AxisAlignment
    -0.43
    woordig
    -0.43
     Races
    -0.43
    POSITIVE LOGITS
     preceding
    0.47
    Personendaten
    0.47
     previous
    0.47
     previo
    0.44
     vorher
    0.40
     previos
    0.40
    before
    0.40
    tagext
    0.39
    previous
    0.38
     Eng
    0.37
    Act Density 0.504%

    No Known Activations