INDEX
Explanations
mentions of environmental impact or consciousness
terms related to environmental, social, and moral issues
New Auto-Interp
Negative Logits
Klu
-0.72
ials
-0.68
Flavoring
-0.67
lag
-0.66
cess
-0.64
mechanisms
-0.64
ger
-0.63
pains
-0.61
Isle
-0.61
Gifts
-0.60
POSITIVE LOGITS
tuned
0.89
ically
0.82
regenerate
0.81
UNCH
0.80
mature
0.79
calibrated
0.77
detonated
0.77
bane
0.76
guided
0.76
satisfying
0.76
Activations Density 0.024%