INDEX
Explanations
words related to darkness or negative contexts
references to darkness or negative connotations associated with the term "dark."
New Auto-Interp
Negative Logits
cule
-0.67
kson
-0.67
yip
-0.67
agine
-0.66
JUST
-0.62
Canaver
-0.61
Pok
-0.60
available
-0.60
Dur
-0.59
Vital
-0.57
POSITIVE LOGITS
ening
1.28
room
1.06
ened
1.06
horse
0.93
rooms
0.92
skinned
0.91
vision
0.89
net
0.88
skies
0.87
haired
0.85
Activations Density 0.025%