INDEX
Explanations
phrases and terms related to progression or advancement
New Auto-Interp
Negative Logits
further
-0.22
furthermore
-0.22
Further
-0.17
weiter
-0.17
sko
-0.17
eldorf
-0.15
à¥įयप
-0.15
uld
-0.15
iform
-0.15
Furthermore
-0.14
POSITIVE LOGITS
ing
0.48
ance
0.47
ed
0.40
ado
0.31
most
0.30
-reaching
0.28
ances
0.27
than
0.26
ANCE
0.26
er
0.26
Activations Density 0.034%