INDEX
    Explanations

    expressions of emotional responses or reactions to experiences

    past tense evaluations of quality

    New Auto-Interp
    Negative Logits
    fjspx
    -0.45
    Autowired
    -0.40
    timi
    -0.39
    zeiro
    -0.37
    anillo
    -0.36
     miroir
    -0.36
     છે
    -0.34
    kia
    -0.34
     وي
    -0.34
    ts
    -0.33
    POSITIVE LOGITS
     was
    0.84
     было
    0.71
     było
    0.68
     wasnt
    0.65
     était
    0.65
     wasn
    0.64
     была
    0.61
    وكان
    0.61
     הייתה
    0.60
     była
    0.58
    Act Density 0.242%

    No Known Activations