INDEX
    Explanations

    words and phrases that could be part of formal writing such as legal or journalistic works

    New Auto-Interp
    Negative Logits
    ThroughAttribute
    -0.91
    LookAnd
    -0.57
    !*\
    -0.52
    PreferredItem
    -0.49
    Personendaten
    -0.47
    InstrumentedTest
    -0.44
    клопе
    -0.44
    enschappelijke
    -0.44
    IsContent
    -0.43
     anfangen
    -0.43
    POSITIVE LOGITS
     be
    3.53
    Be
    2.55
    be
    2.53
     Be
    2.50
     BE
    1.93
    BE
    1.64
     soient
    1.62
     sejam
    1.34
     soyez
    1.32
     быть
    1.23
    Act Density 15.059%

    No Known Activations