INDEX
    Explanations

    verbal phrases indicating actions or emotions

    expressions of contradiction or hypocrisy in political contexts

    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.49
    enderror
    -0.48
    withIdentifier
    -0.47
    HtmlAttribute
    -0.47
    CppCodeGen
    -0.45
    phazard
    -0.45
    Aholisi
    -0.44
    گران
    -0.44
    Unmarshaller
    -0.44
     AssemblyProduct
    -0.44
    POSITIVE LOGITS
     hasta
    0.97
     till
    0.91
    beyond
    0.88
     até
    0.87
     sampai
    0.87
     extreme
    0.85
     beyond
    0.85
     jusqu
    0.81
    0.81
     max
    0.81
    Act Density 0.140%

    No Known Activations