INDEX
Explanations
personal anecdotes or stories written in first person perspective
phrases expressing personal feelings and opinions
New Auto-Interp
Head Attr Weights
0:0.08
1:0.04
2:0.11
3:0.07
4:0.03
5:0.10
6:0.06
7:0.06
8:0.10
9:0.13
10:0.12
11:0.05
Negative Logits
predec
-1.33
differe
-1.29
reperc
-1.19
detrim
-1.15
paran
-1.14
disadvant
-1.14
foothold
-1.13
ulton
-1.09
aggregation
-1.06
dexter
-1.06
POSITIVE LOGITS
myself
1.28
igans
1.21
lished
1.14
obic
1.11
eur
1.11
aniel
1.10
ل
1.09
////////////////////////////////
1.08
tears
1.07
:=
1.07
Activations Density 0.041%