INDEX
    Explanations

    statements emphasizing subjective experiences and personal reflections

    New Auto-Interp
    Negative Logits
    arians
    -0.52
     consciences
    -0.50
    eins
    -0.49
    nesse
    -0.48
    ρους
    -0.47
     Ness
    -0.46
    jutnya
    -0.46
    euvre
    -0.46
    vais
    -0.45
     BOD
    -0.45
    POSITIVE LOGITS
    而是
    0.85
    DeleteBehavior
    0.85
    rungsseite
    0.83
     sondern
    0.82
     بلکه
    0.80
     nor
    0.79
     CreateTagHelper
    0.78
    AndEndTag
    0.73
     melainkan
    0.72
    UnsafeEnabled
    0.69
    Act Density 0.266%

    No Known Activations