INDEX
Explanations
phrases related to growth, development, and change
New Auto-Interp
Negative Logits
ãĥīãĥ©
-0.77
hal
-0.74
mates
-0.73
pper
-0.72
puted
-0.71
tta
-0.70
sol
-0.68
ij士
-0.65
oops
-0.63
ppers
-0.63
POSITIVE LOGITS
exponentially
1.40
steadily
1.05
momentum
0.99
explos
0.97
enormously
0.97
tremend
0.96
rapidly
0.95
dramatically
0.95
tremendously
0.93
awareness
0.92
Activations Density 4.376%