INDEX
Explanations
references to emotions, particularly focusing on positive experiences and their effects
New Auto-Interp
Negative Logits
CreateTagHelper
-0.67
BorderStyle
-0.53
الرياضيه
-0.53
|}{}-0.52
LOWER
-0.51
SearchView
-0.51
URG
-0.51
lioz
-0.51
)=-\
-0.50
rawl
-0.50
POSITIVE LOGITS
positive
0.73
(+)
0.69
positivo
0.65
praising
0.64
Positive
0.64
RTCK
0.62
positiva
0.62
praises
0.60
positif
0.59
praise
0.59
Activations Density 0.991%