INDEX
Explanations
phrases related to hypothetical scenarios and decision-making
conditional phrases or expressions of possibility
New Auto-Interp
Negative Logits
foreseen
-0.64
ZI
-0.63
igma
-0.63
pires
-0.61
Tumblr
-0.60
reality
-0.60
ansas
-0.59
itis
-0.59
engineering
-0.59
trendy
-0.58
POSITIVE LOGITS
be
1.09
also
0.99
derive
0.95
introduce
0.95
arrive
0.94
create
0.93
incorporate
0.92
divert
0.91
propose
0.90
doubtless
0.89
Activations Density 0.295%