INDEX
Explanations
phrases related to professional or organizational responsibility
New Auto-Interp
Negative Logits
rane
-0.15
und
-0.15
uu
-0.15
ázd
-0.14
oggler
-0.14
zwar
-0.14
acades
-0.13
ÏĦή
-0.13
essel
-0.13
but
-0.13
POSITIVE LOGITS
also
0.19
also
0.17
także
0.17
Ø£ÙĬضا
0.17
actual
0.16
equally
0.16
actual
0.16
ones
0.15
también
0.15
Also
0.15
Activations Density 0.233%