INDEX
Explanations
words associated with warmth and comfort
New Auto-Interp
Negative Logits
idal
-0.19
olls
-0.17
ials
-0.16
airs
-0.15
jed
-0.15
ually
-0.15
ippi
-0.14
uen
-0.14
rious
-0.14
ëĿ½
-0.14
POSITIVE LOGITS
-blood
0.26
fuzzy
0.25
est
0.25
weather
0.21
ong
0.21
ingly
0.20
blood
0.20
-hearted
0.20
uzzy
0.20
weather
0.19
Activations Density 0.020%