INDEX
Explanations
pronouns and verbs indicating personal experience or involvement
references to personal experiences and struggles
New Auto-Interp
Negative Logits
nods
-0.77
endment
-0.76
trustworthy
-0.67
Awesome
-0.66
$$$$
-0.66
ops
-0.65
clicks
-0.65
enary
-0.64
independent
-0.64
Guinness
-0.64
POSITIVE LOGITS
faced
1.28
encountered
1.18
endured
1.15
incurred
1.12
inflict
1.07
inflicted
1.06
encounter
1.06
caused
1.06
suffered
1.04
suffer
1.04
Activations Density 0.193%