INDEX
    Explanations

    terms related to principles and accountability in various contexts

    New Auto-Interp
    Negative Logits
     desmotivaciones
    -0.66
     Monfieur
    -0.65
     المعيارى
    -0.64
     defaultstate
    -0.62
     @"/
    -0.60
    +#+#
    -0.60
    aarrggbb
    -0.58
    Билгалдахарш
    -0.57
     שוליים
    -0.57
     víctimas
    -0.57
    POSITIVE LOGITS
     Principal
    1.66
     principal
    1.62
    PRINCIP
    1.61
     principle
    1.59
     PRINCIP
    1.58
     princip
    1.55
     Principle
    1.54
    Princip
    1.50
     Princip
    1.48
    princip
    1.48
    Act Density 0.506%

    No Known Activations