INDEX
Explanations
words indicating possibility and difficulty
New Auto-Interp
Negative Logits
можна
0.84
假如
0.79
тех
0.74
ermöglichen
0.74
можно
0.74
బ్యా
0.71
bekannt
0.71
квадра
0.71
$.
0.69
我要
0.69
POSITIVE LOGITS
struggled
1.98
hesitated
1.90
unsure
1.87
hesitant
1.77
struggles
1.76
confused
1.74
struggle
1.72
struggling
1.71
perplexed
1.68
puzzled
1.67
Activations Density 0.260%