INDEX
Explanations
verbs related to causes or effects
phrases that indicate causation or outcomes
New Auto-Interp
Negative Logits
anooga
-0.90
fortun
-0.68
jon
-0.61
quickShipAvailable
-0.61
oret
-0.59
Compass
-0.58
Kara
-0.57
FX
-0.57
resy
-0.56
Telegram
-0.56
POSITIVE LOGITS
stumble
1.05
pursue
1.02
contemplate
0.95
ponder
0.93
indulge
0.92
rethink
0.91
engage
0.90
realize
0.90
retire
0.90
othy
0.90
Activations Density 0.105%