INDEX
Explanations
phrases that involve giving credit or recognition
New Auto-Interp
Negative Logits
à¹Ģลà¸Ĥ
-0.16
èĩº
-0.16
817
-0.15
ederland
-0.15
contend
-0.14
icher
-0.14
bum
-0.14
çĵľ
-0.14
ampo
-0.14
tü
-0.14
POSITIVE LOGITS
credit
0.36
Credit
0.30
credit
0.27
Credit
0.26
Bravo
0.24
deserves
0.23
udos
0.22
credits
0.21
admirable
0.21
deserve
0.19
Activations Density 0.231%