INDEX
Explanations
the word "attitude" with different contexts and intensities
recurring themes related to attitudes or beliefs
New Auto-Interp
Negative Logits
enegger
-1.01
icles
-0.92
oval
-0.80
icle
-0.76
aneously
-0.74
icular
-0.73
eri
-0.71
ÄŁ
-0.70
iaries
-0.70
lisher
-0.68
POSITIVE LOGITS
attitude
1.15
toward
1.12
towards
1.07
attitudes
0.89
Towards
0.84
Tow
0.76
ysis
0.75
uation
0.73
perv
0.71
indifference
0.71
Activations Density 0.023%