INDEX
Explanations
instances of the word "attitude" with varying degrees of intensity
mentions of specific attitudes or perspectives
New Auto-Interp
Negative Logits
enegger
-0.89
icles
-0.87
icle
-0.75
dry
-0.71
idden
-0.67
oval
-0.67
baum
-0.66
anz
-0.66
schild
-0.65
Interstitial
-0.65
POSITIVE LOGITS
toward
1.32
towards
1.28
attitude
1.12
Towards
1.03
attitudes
0.88
Tow
0.82
ysis
0.78
stance
0.76
hostility
0.73
uation
0.73
Activations Density 0.038%