INDEX
Explanations
rewards for contribution or exchange
New Auto-Interp
Negative Logits
Ziel
0.42
সড়
0.42
அறிவியல்
0.42
தொடர்ப
0.40
rowadz
0.40
ীভূত
0.40
Verfahren
0.40
convulsions
0.39
disorders
0.39
উদ্বেগ
0.39
POSITIVE LOGITS
reward
1.12
rewards
1.02
rewarded
0.94
recompens
0.94
reward
0.87
報酬
0.87
remuneration
0.85
repay
0.83
Reward
0.83
Reward
0.82
Activations Density 0.108%