INDEX
Explanations
mentions of gratitude and appreciation towards individuals involved in collaborative projects
New Auto-Interp
Negative Logits
ãĥ¬ãĥĥãĥĪ
-0.15
okt
-0.15
JA
-0.15
.toolbox
-0.14
gate
-0.14
ục
-0.14
ekim
-0.14
illet
-0.14
eniable
-0.14
Buk
-0.14
POSITIVE LOGITS
Mas
0.32
Hide
0.26
Hi
0.25
Minor
0.25
Mas
0.25
Takes
0.23
Hide
0.23
Take
0.23
ÐľÐ°Ñģ
0.23
Nob
0.23
Activations Density 0.056%