INDEX
Explanations
instances where something would have occurred or been done
instances of the phrase "would have," indicating hypothetical or conditional scenarios
New Auto-Interp
Negative Logits
Cold
-0.60
plaintiff
-0.57
osi
-0.56
poppy
-0.56
muse
-0.56
narrator
-0.56
seasoning
-0.54
doll
-0.54
Trivia
-0.53
hostage
-0.53
POSITIVE LOGITS
been
1.04
gotten
0.98
been
0.97
ĸļ
0.85
¶
0.84
taken
0.83
Ģ
0.81
gotten
0.80
gone
0.77
Been
0.76
Activations Density 0.067%