INDEX
Explanations
phrases where someone is instructing or advising someone else
the preposition "to" in various contexts
New Auto-Interp
Negative Logits
divides
-0.67
hops
-0.67
widened
-0.66
licenses
-0.65
accelerated
-0.62
netted
-0.61
divisions
-0.61
Provision
-0.60
partitions
-0.60
deductions
-0.59
POSITIVE LOGITS
wered
1.12
pless
0.99
othy
0.97
asted
0.92
asts
0.89
lling
0.89
ilet
0.88
ller
0.87
ffee
0.85
remind
0.85
Activations Density 0.328%