INDEX
Explanations
constructive feedback or criticism
New Auto-Interp
Negative Logits
plicht
0.41
ื่อง
0.39
مظ
0.38
declare
0.37
toire
0.36
調べ
0.36
宣言
0.36
symmetrically
0.36
囤
0.35
ोसिएशन
0.35
POSITIVE LOGITS
feedback
3.27
Feedback
2.98
Feedback
2.95
feedback
2.92
反馈
2.69
feedbacks
2.41
constructive
1.91
critique
1.75
critiques
1.69
criticism
1.66
Activations Density 0.118%