INDEX
Explanations
words related to waste management, specifically the word "garbage"
references to waste and garbage
New Auto-Interp
Negative Logits
ym
-0.85
saf
-0.85
akening
-0.84
cies
-0.76
slow
-0.75
imble
-0.74
atern
-0.74
rons
-0.73
sen
-0.72
igmat
-0.70
POSITIVE LOGITS
garbage
1.27
bage
1.11
dumps
1.07
heap
0.99
rubbish
0.94
dump
0.93
bins
0.92
trash
0.90
collector
0.88
cans
0.87
Activations Density 0.007%