INDEX
    Explanations

    references to experiences or evaluations of individuals and their actions

    New Auto-Interp
    Negative Logits
     surrounded
    -0.52
     angekommen
    -0.47
    intios
    -0.44
     Wiktionnaire
    -0.42
     égard
    -0.42
    Derbyniad
    -0.41
    DetailComponent
    -0.40
    nasium
    -0.40
    mulos
    -0.40
     naturen
    -0.39
    POSITIVE LOGITS
     lenient
    0.79
     ModelExpression
    0.78
     generous
    0.78
    /**
    0.76
     unhelpful
    0.75
     gracious
    0.72
    cooperative
    0.71
     amables
    0.71
     courteous
    0.71
    TestingModule
    0.70
    Act Density 0.324%

    No Known Activations