INDEX
Explanations
phrases indicating a change or transformation
instances of the word "turn" and its variations in various contexts
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.90
andan
-0.61
hess
-0.59
Expansion
-0.57
ority
-0.57
foundation
-0.56
constitu
-0.56
den
-0.55
DAQ
-0.54
bons
-0.54
POSITIVE LOGITS
around
0.92
inward
0.77
Ī
0.77
into
0.76
coat
0.76
around
0.72
tide
0.69
heads
0.68
INTO
0.67
Tide
0.67
Activations Density 0.035%