INDEX
Explanations
references to competitive events or contests
New Auto-Interp
Negative Logits
bedankt
-0.54
Dziękuję
-0.52
ardoor
-0.48
GOTREF
-0.48
fara
-0.48
mphony
-0.47
カンド
-0.46
thanked
-0.46
Спасибо
-0.46
спасибо
-0.45
POSITIVE LOGITS
competition
1.45
challenge
1.38
competitions
1.33
challenges
1.24
competition
1.20
contest
1.19
contests
1.17
Challenge
1.16
challenge
1.15
Competition
1.13
Activations Density 0.376%