INDEX
Explanations
references to modern societal issues and challenges
New Auto-Interp
Negative Logits
ansen
-0.07
inya
-0.07
ativ
-0.06
akk
-0.06
tep
-0.06
ENTE
-0.06
anton
-0.06
ills
-0.06
Flo
-0.06
Ø·Ùĩ
-0.06
POSITIVE LOGITS
world
0.11
environment
0.11
society
0.09
times
0.08
ìĦ¸ìĥģ
0.08
environments
0.08
çݯå¢ĥ
0.08
миÑĢе
0.07
landscape
0.07
ortam
0.07
Activations Density 0.026%