INDEX
    Explanations

    expressions of positivity or approval

    Following the word "good"

    New Auto-Interp
    Negative Logits
     ilha
    -0.48
    Ashlee
    -0.44
     transfer
    -0.43
    StreamWriter
    -0.42
     cadeia
    -0.42
     Wraith
    -0.42
    transfer
    -0.41
    PathVariable
    -0.41
     transferência
    -0.41
     Isla
    -0.40
    POSITIVE LOGITS
    Good
    1.27
     Good
    1.23
     GOOD
    1.20
    good
    1.20
     good
    1.17
    GOOD
    1.14
     Хоро
    0.88
    Хороший
    0.87
     bonnes
    0.86
     buen
    0.85
    Act Density 0.080%

    No Known Activations