INDEX
    Explanations

    instances and phrases related to justification or reasoning

    New Auto-Interp
    Negative Logits
     AssemblyVersion
    -0.75
     jLabel
    -0.73
     ModelExpression
    -0.67
    //});
    -0.65
     Monfieur
    -0.64
     Majefty
    -0.63
    ')")
    -0.62
    FlatAppearance
    -0.60
     ſtate
    -0.60
     endforeach
    -0.60
    POSITIVE LOGITS
     sheer
    0.87
     alone
    0.69
     pure
    0.68
     Allein
    0.68
     allein
    0.67
    alone
    0.64
     alleine
    0.61
     alene
    0.59
     sake
    0.58
     test
    0.56
    Act Density 0.297%

    No Known Activations