INDEX
Explanations
potential actions and abilities
New Auto-Interp
Negative Logits
このように
0.82
ಒಂದು
0.81
它
0.77
ඔහුගේ
0.77
било
0.77
ഒരു
0.76
което
0.75
സിന്റെ
0.75
അതിന്റെ
0.73
grafico
0.73
POSITIVE LOGITS
themselves
1.33
whom
1.14
whom
0.94
willing
0.92
who
0.92
reputations
0.92
sympathize
0.84
जिनके
0.84
quienes
0.83
salaries
0.83
Activations Density 0.045%