INDEX
Explanations
references to gratitude and support
New Auto-Interp
Negative Logits
oli
-0.16
ubbo
-0.15
सन
-0.14
ç·Ĵ
-0.14
ordo
-0.14
ordin
-0.14
erle
-0.14
íĹ
-0.14
affairs
-0.14
ordion
-0.14
POSITIVE LOGITS
efforts
0.26
effort
0.24
contribution
0.23
contributions
0.21
help
0.20
continued
0.18
work
0.18
trouble
0.18
oyal
0.17
handling
0.17
Activations Density 0.058%