INDEX
Explanations
phrases related to contributions and significant impacts
New Auto-Interp
Negative Logits
ismet
-0.14
ries
-0.14
edom
-0.14
inaire
-0.13
kees
-0.13
rahim
-0.13
ãģ¡ãĤī
-0.13
лÑĸÑĤ
-0.13
omp
-0.13
eref
-0.13
POSITIVE LOGITS
contribution
1.13
contributions
1.10
contrib
0.94
Contribution
0.94
Contributions
0.90
Contrib
0.89
contrib
0.87
contribute
0.84
contributed
0.84
contributing
0.81
Activations Density 0.180%