INDEX
Explanations
instances of playful or humorous interactions and moments
New Auto-Interp
Negative Logits
InputBorder
-0.55
arise
-0.53
involve
-0.53
volves
-0.52
EClass
-0.50
occur
-0.49
+:+
-0.49
tedly
-0.48
حات
-0.47
onnay
-0.46
POSITIVE LOGITS
took
1.71
went
1.62
got
1.59
walked
1.54
gave
1.51
wrote
1.50
drove
1.49
tried
1.48
waited
1.45
drank
1.44
Activations Density 0.780%