INDEX
Explanations
references to "green" or environmentally themed elements
New Auto-Interp
Negative Logits
redd
-0.18
tk
-0.17
cz
-0.16
ollipop
-0.15
rell
-0.15
ëĿ½
-0.15
aurus
-0.15
cles
-0.14
erah
-0.14
zej
-0.14
POSITIVE LOGITS
ery
0.44
ish
0.28
houses
0.28
wich
0.26
peace
0.26
belt
0.26
est
0.24
washing
0.23
leaf
0.23
ERY
0.22
Activations Density 0.039%