INDEX
Explanations
acknowledging user's correct points
New Auto-Interp
Negative Logits
뒹
0.78
かね
0.69
await
0.68
considers
0.65
본격
0.64
বিপর
0.63
爱你
0.63
Await
0.62
ledu
0.62
வெறு
0.62
POSITIVE LOGITS
rightly
1.18
correctly
1.17
ถูกต้อง
1.09
correct
1.02
правильно
1.02
Correct
1.00
deserve
0.98
CORRECT
0.98
Correct
0.96
rightfully
0.94
Activations Density 0.383%