INDEX
Explanations
phrases indicating a transition or change of some sort
phrases that indicate transformation or progression towards a new state
New Auto-Interp
Negative Logits
ailability
-0.72
©¶æ
-0.65
rouse
-0.65
blinded
-0.63
trained
-0.62
cowork
-0.62
Canterbury
-0.61
towed
-0.60
forefront
-0.60
Buckingham
-0.60
POSITIVE LOGITS
thood
0.75
ãģ¦
0.71
ENCE
0.71
actionDate
0.70
encia
0.70
ptive
0.69
bert
0.67
earing
0.67
tons
0.67
ptoms
0.66
Activations Density 0.041%