INDEX
Explanations
phrases or sentences describing actions or states
instances of the verb "was" in various contexts
New Auto-Interp
Negative Logits
Previous
-0.73
Highlights
-0.72
inav
-0.70
Footnote
-0.69
Purchase
-0.65
atives
-0.65
lev
-0.65
Supports
-0.64
ESE
-0.63
entails
-0.62
POSITIVE LOGITS
nt
1.09
wolves
1.03
gonna
1.02
supposed
0.96
wolf
0.95
hes
0.94
definitely
0.93
able
0.93
going
0.89
destined
0.87
Activations Density 0.449%