INDEX
    Explanations

    phrases encouraging openness and communication

    New Auto-Interp
    Negative Logits
     EconPapers
    -0.95
    -0.82
    SharedDtor
    -0.75
     CWE
    -0.74
     autorytatywna
    -0.73
    verwijspagina
    -0.68
     AssemblyProduct
    -0.67
    raszamy
    -0.67
    nemia
    -0.66
     tartalomajánló
    -0.66
    POSITIVE LOGITS
    Feel
    0.56
     feel
    0.54
     felt
    0.53
    Felt
    0.52
     feels
    0.51
     "}
    0.51
    ".
    
    0.51
    "}}
    0.50
    '}}
    0.50
     ""
    
    0.49
    Act Density 0.122%

    No Known Activations