INDEX
Explanations
expressions of emotional distress and support
New Auto-Interp
Head Attr Weights
0:0.08
1:0.02
2:0.43
3:0.12
4:0.03
5:0.08
6:0.02
7:0.05
8:0.03
9:0.03
10:0.05
11:0.02
Negative Logits
eligible
-2.95
Scholarship
-2.69
eligibility
-2.66
scholarships
-2.64
reuse
-2.62
pedia
-2.59
profits
-2.54
winner
-2.52
specialization
-2.51
Foss
-2.48
POSITIVE LOGITS
Anxiety
5.07
panic
4.78
anxiety
4.76
calmed
4.57
emotion
4.45
emotions
4.44
xiety
4.44
calm
4.43
feeling
4.42
sadness
4.37
Activations Density 1.022%