INDEX
Explanations
phrases related to comparisons and uncertainties
phrases indicating the essence or identity of a subject
New Auto-Interp
Negative Logits
Roll
-0.63
reflective
-0.59
juggling
-0.55
icipated
-0.54
Hands
-0.50
buildup
-0.50
Direct
-0.50
Ware
-0.49
hostage
-0.49
particip
-0.49
POSITIVE LOGITS
yes
1.01
YES
0.99
YES
0.93
Nope
0.90
yes
0.89
Yes
0.88
laughs
0.81
anyway
0.80
answer
0.80
Yes
0.79
Activations Density 1.506%