INDEX
Explanations
phrases related to environmental impact and pollution
New Auto-Interp
Negative Logits
orida
-0.17
hardt
-0.17
vod
-0.15
aby
-0.15
견
-0.14
λÏī
-0.14
CID
-0.14
گاب
-0.14
extras
-0.13
draining
-0.13
POSITIVE LOGITS
air
0.45
PM
0.37
Air
0.37
Poll
0.34
pollution
0.34
poll
0.33
AQ
0.33
partic
0.33
Poll
0.32
Air
0.31
Activations Density 0.089%