INDEX
Explanations
words related to the process of causing harm or damage to something, potentially in various contexts such as military operations, product quality, ecosystem health, and online behavior
terms related to the concept of degradation, particularly in contexts of environmental or status decline
New Auto-Interp
Negative Logits
enegger
-0.89
RH
-0.71
äºĶ
-0.69
ohyd
-0.69
NING
-0.68
olin
-0.67
verning
-0.67
ning
-0.66
atorial
-0.64
rouse
-0.64
POSITIVE LOGITS
degradation
1.20
gradation
1.11
degraded
1.06
degrade
1.02
degrading
0.97
destro
0.87
disadvant
0.76
termination
0.76
graded
0.76
ModLoader
0.75
Activations Density 0.014%