INDEX
Explanations
expressions of praise and credit for achievements
New Auto-Interp
Negative Logits
ома
-0.16
@testable
-0.14
TEE
-0.14
bạc
-0.14
½æķ°
-0.14
.exchange
-0.14
çIJ´
-0.13
ARRIER
-0.13
anco
-0.13
ture
-0.13
POSITIVE LOGITS
credit
0.76
Credit
0.63
credit
0.62
Credit
0.59
credits
0.54
props
0.49
Credits
0.46
_credit
0.45
.credit
0.43
Credits
0.41
Activations Density 0.187%