INDEX
    Explanations

    references to choices or conditions related to possibility and preference

    New Auto-Interp
    Negative Logits
    TestBed
    -0.58
    SerializedName
    -0.54
    ennung
    -0.53
     Reap
    -0.52
    rouvez
    -0.51
     postIndex
    -0.51
    ]--;
    -0.49
     pprint
    -0.47
     Geste
    -0.47
     Validators
    -0.47
    POSITIVE LOGITS
     not
    0.80
     whether
    0.79
    それとも
    0.71
     нет
    0.67
    whether
    0.67
     humaines
    0.66
     خیر
    0.65
    not
    0.64
    otherwise
    0.58
     otherwise
    0.58
    Act Density 0.153%

    No Known Activations