INDEX
Explanations
positive adjectives related to brightness or light
New Auto-Interp
Negative Logits
GAN
-0.92
mx
-0.91
ombat
-0.86
orset
-0.86
ilage
-0.84
FactoryReloaded
-0.83
AX
-0.83
berus
-0.82
berman
-0.81
utor
-0.81
POSITIVE LOGITS
lights
1.27
ened
1.24
ening
1.23
eners
1.19
lights
1.17
brightest
1.14
brighter
1.13
bulbs
1.10
lighting
1.07
bright
1.06
Activations Density 1.846%