INDEX
Explanations
phrases indicating interest or focus
instances of the phrase "to" that indicate intention or purpose
New Auto-Interp
Negative Logits
arten
-0.69
umb
-0.66
cass
-0.65
refunds
-0.63
freezing
-0.60
exemptions
-0.60
opus
-0.60
signaled
-0.59
awei
-0.59
unn
-0.59
POSITIVE LOGITS
ggles
0.99
behold
0.90
mes
0.85
contemplate
0.84
pper
0.79
ADS
0.77
me
0.75
asts
0.74
ilet
0.74
roe
0.71
Activations Density 0.128%