INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Whittaker
    -0.54
    Gilla
    -0.46
     Franke
    -0.45
    retudo
    -0.45
     beset
    -0.45
    nemouth
    -0.45
     Fré
    -0.44
     Whelan
    -0.44
    rensen
    -0.43
     Belvedere
    -0.43
    POSITIVE LOGITS
     laws
    1.90
     Laws
    1.81
    Laws
    1.80
     LAWS
    1.56
    laws
    1.51
     leyes
    1.25
     lois
    1.23
     Gesetze
    1.21
     Geset
    1.05
     leggi
    1.05
    Act Density 0.005%

    No Known Activations