INDEX
Explanations
the presence of the word "trying to"
phrases that convey attempts or intentions
New Auto-Interp
Negative Logits
most
-0.68
Clouds
-0.63
Rising
-0.62
Returning
-0.61
Appears
-0.60
Cros
-0.59
Houses
-0.59
ILY
-0.57
Vol
-0.56
Lag
-0.56
POSITIVE LOGITS
recreate
1.23
emulate
1.21
convince
1.16
imitate
1.15
revive
1.14
replicate
1.12
conserve
1.11
reconcile
1.10
establish
1.06
eliminate
1.06
Activations Density 0.075%