INDEX
    Explanations

    words indicating possibility and difficulty

    New Auto-Interp
    Negative Logits
     можна
    0.84
    假如
    0.79
     тех
    0.74
     ermöglichen
    0.74
     можно
    0.74
    బ్యా
    0.71
     bekannt
    0.71
     квадра
    0.71
    $.
    0.69
    我要
    0.69
    POSITIVE LOGITS
     struggled
    1.98
     hesitated
    1.90
     unsure
    1.87
     hesitant
    1.77
     struggles
    1.76
     confused
    1.74
     struggle
    1.72
     struggling
    1.71
     perplexed
    1.68
     puzzled
    1.67
    Act Density 0.260%

    No Known Activations