INDEX
Explanations
short verb phrases or clauses indicating action or change of action
instances of direct speech or statements within the text
New Auto-Interp
Negative Logits
.","
-0.84
ÂŃ
-0.80
biod
-0.71
âμ
-0.71
Mehran
-0.70
alties
-0.66
paddle
-0.65
transform
-0.64
encomp
-0.64
locally
-0.64
POSITIVE LOGITS
resa
1.05
anon
1.00
Conclusion
0.94
laughter
0.89
odore
0.87
nah
0.87
Answer
0.84
reply
0.84
mosp
0.84
HAHAHAHA
0.83
Activations Density 0.515%