INDEX
Negative Logits
juiste
0.75
wrong
0.72
inappropriate
0.72
正确
0.71
ంశ
0.70
GQ
0.70
المنا
0.69
inconsistencies
0.69
正确的
0.69
correct
0.68
POSITIVE LOGITS
建议
1.09
strongly
1.05
consiglio
1.03
Recommendation
1.02
建議
1.01
recommend
1.00
recomenda
0.98
recommendation
0.98
Recommended
0.98
highly
0.97
Activations Density 0.168%