INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    enderror
    -0.95
    Życiorys
    -0.94
     pitié
    -0.81
    sonaro
    -0.79
     sauvages
    -0.76
     Głów
    -0.74
     Nagpur
    -0.73
     sauvage
    -0.73
     Svensk
    -0.71
    endphp
    -0.71
    POSITIVE LOGITS
     bridge
    2.81
     bridges
    2.80
     Bridge
    2.65
    Bridge
    2.46
     BRIDGE
    2.42
     Bridges
    2.41
    bridge
    2.40
    bridges
    2.28
    Bridges
    2.27
     BRID
    2.02
    Act Density 0.040%

    No Known Activations