INDEX
Explanations
past tense or adjective forms
New Auto-Interp
Negative Logits
ت
1.09
ל
0.87
т
0.73
ת
0.71
י
0.70
с
0.69
ات
0.69
و
0.68
л
0.67
सँग
0.66
POSITIVE LOGITS
resolved
0.55
SE
0.54
一定的
0.54
Changed
0.49
था
0.47
ays
0.46
Warsz
0.46
{\0.46
प्रस्तावित
0.46
å
0.45
Activations Density 0.000%