INDEX
Explanations
verbs or verb phrases related to hypothetical scenarios
instances of the verb "had" and its contextual implications
New Auto-Interp
Negative Logits
intens
-0.61
redef
-0.59
brink
-0.58
unveiling
-0.57
parody
-0.56
hammer
-0.53
Extreme
-0.53
hilarious
-0.53
reprint
-0.52
pinnacle
-0.52
POSITIVE LOGITS
been
1.26
gotten
1.14
been
1.11
stayed
1.01
behaved
0.99
intervened
0.99
waited
0.95
gotten
0.95
existed
0.94
gone
0.93
Activations Density 0.142%