INDEX
Explanations
phrases encouraging action or decision
phrases suggesting encouragement or urging someone to take action
New Auto-Interp
Negative Logits
verage
-0.70
amiya
-0.68
Tickets
-0.62
cers
-0.61
arrivals
-0.61
ctors
-0.61
cible
-0.59
involved
-0.59
represented
-0.58
åº
-0.58
POSITIVE LOGITS
washer
0.77
raining
0.71
unes
0.70
Ħ¢
0.69
Matrix
0.69
acronym
0.69
chy
0.65
myself
0.65
anyway
0.64
anyways
0.63
Activations Density 0.365%