INDEX
Explanations
environment and health
references to environmental impact or environmental harm.
New Auto-Interp
Negative Logits
to
-1.14
for
-1.04
方がいい
-0.99
necedor
-0.94
между
-0.91
ほうがいい
-0.91
utkan
-0.87
oscuros
-0.87
americas
-0.85
kreeg
-0.84
POSITIVE LOGITS
by
1.07
特别是
1.05
something
1.05
продъл
1.03
&
1.00
sol
1.00
gart
0.94
럽
0.92
when
0.91
long
0.90
Activations Density 0.041%