INDEX
Explanations
verbs and nouns related to contribution and its effects
New Auto-Interp
Negative Logits
rm
-0.18
arp
-0.16
imeo
-0.15
iminal
-0.15
fold
-0.14
bru
-0.14
/desktop
-0.14
ear
-0.14
leigh
-0.14
retorno
-0.14
POSITIVE LOGITS
endum
0.18
contribution
0.17
-contrib
0.17
contributions
0.16
contrib
0.15
ordin
0.15
ToFile
0.15
ìĦĿ
0.15
rible
0.15
924
0.15
Activations Density 0.042%