INDEX
Explanations
phrases related to challenges or negative aspects
words that pertain to burdensome situations or conditions
New Auto-Interp
Negative Logits
artney
-0.87
avering
-0.84
arching
-0.77
avia
-0.77
IVERS
-0.75
veland
-0.74
orah
-0.72
ependence
-0.72
mbuds
-0.71
igsaw
-0.69
POSITIVE LOGITS
ly
0.82
ities
0.80
aneously
0.78
NESS
0.74
behaviour
0.74
substances
0.73
lifestyles
0.73
measures
0.72
distractions
0.71
ized
0.71
Activations Density 0.136%