INDEX
Explanations
quantifying amounts, goals, or changes
New Auto-Interp
Negative Logits
放在
0.45
টাকা
0.42
navigateTo
0.41
łączyć
0.41
प्लेइंग
0.39
డబ్బు
0.39
..”
0.39
ಾಗಿ
0.37
गेशन
0.37
র্
0.36
POSITIVE LOGITS
averaged
0.60
spent
0.55
spends
0.52
averages
0.52
consumes
0.51
averaging
0.50
Spent
0.50
spend
0.46
каж
0.46
average
0.45
Activations Density 0.030%