INDEX
    Explanations

    numeric values and variable identifiers commonly used in programming or mathematical expressions

    New Auto-Interp
    Negative Logits
    `]
    -0.49
     Y
    -0.49
    -0.48
     class
    -0.47
     company
    -0.47
     sex
    -0.46
    ixx
    -0.46
    yki
    -0.46
     L
    -0.45
    ]}>
    -0.44
    POSITIVE LOGITS
     Jurí
    0.84
     myſelf
    0.82
     Normdatei
    0.79
    ItemBackground
    0.75
     pleaſure
    0.74
     Theſe
    0.74
    IsMutable
    0.74
     himſelf
    0.73
    těte
    0.73
     purpoſe
    0.73
    Act Density 0.823%

    No Known Activations