INDEX
Explanations
mentions of attitudes or beliefs
discussions and mentions surrounding the concept of "attitude."
New Auto-Interp
Negative Logits
enegger
-0.96
icles
-0.87
oval
-0.76
icle
-0.72
amen
-0.72
idden
-0.71
dry
-0.67
ichen
-0.67
eri
-0.67
anz
-0.66
POSITIVE LOGITS
toward
1.16
attitude
1.16
towards
1.13
Towards
0.93
attitudes
0.91
Tow
0.77
ysis
0.73
stance
0.72
uation
0.72
disposition
0.70
Activations Density 0.036%