INDEX
Explanations
verbs expressing intention or planned actions
New Auto-Interp
Negative Logits
Solitaire
-0.75
acks
-0.66
adj
-0.64
anca
-0.61
EStreamFrame
-0.60
Americ
-0.60
archetype
-0.58
java
-0.58
lc
-0.57
unch
-0.57
POSITIVE LOGITS
lessly
0.79
fully
0.72
provoking
0.70
phis
0.69
orial
0.69
untarily
0.69
Parenthood
0.68
ller
0.68
eering
0.68
atively
0.68
Activations Density 0.017%