INDEX
Explanations
phrases indicating growth, progress, or increase
New Auto-Interp
Negative Logits
egot
-0.16
echan
-0.14
ABS
-0.14
à¤Ĥश
-0.14
odb
-0.13
akedown
-0.13
woke
-0.13
Few
-0.13
retch
-0.13
Fully
-0.13
POSITIVE LOGITS
fast
0.79
fast
0.67
-fast
0.63
Fast
0.60
faster
0.60
FAST
0.59
Fast
0.59
fastest
0.58
.fast
0.56
_fast
0.54
Activations Density 0.324%