INDEX
Explanations
themes of personal connection and emotional expression
New Auto-Interp
Head Attr Weights
0:0.10
1:0.02
2:0.05
3:0.15
4:0.05
5:0.07
6:0.03
7:0.03
8:0.06
9:0.16
10:0.17
11:0.06
Negative Logits
analysed
-1.41
labelled
-1.39
¶
-1.30
purportedly
-1.27
Created
-1.27
algorith
-1.27
allegedly
-1.23
dominated
-1.23
labeled
-1.20
comprised
-1.20
POSITIVE LOGITS
morrow
1.56
somew
1.42
?"
1.36
tomorrow
1.33
bye
1.31
?'
1.29
somebody
1.29
trave
1.24
bye
1.23
IENCE
1.19
Activations Density 0.680%