INDEX
Explanations
references to community initiatives and government policies
New Auto-Interp
Negative Logits
ìĿ¸ëį°
-0.16
either
-0.16
oris
-0.15
enberg
-0.15
æĹ¢
-0.14
aru
-0.14
geb
-0.14
oran
-0.14
además
-0.14
enville
-0.14
POSITIVE LOGITS
initially
0.18
may
0.17
certainly
0.17
overall
0.16
may
0.16
nok
0.15
SOME
0.15
maz
0.15
somewhat
0.14
technically
0.14
Activations Density 0.224%