INDEX
Explanations
phrases indicating future actions or events
New Auto-Interp
Negative Logits
alic
-0.17
continu
-0.17
å°ļ
-0.14
subsequently
-0.14
746
-0.14
Contin
-0.14
Scoped
-0.14
continua
-0.14
ÏįÏĦε
-0.14
zap
-0.14
POSITIVE LOGITS
potentially
0.18
orial
0.16
become
0.16
soon
0.16
hopefully
0.16
imminent
0.16
shortly
0.16
оÑĩеÑĢед
0.16
becoming
0.15
final
0.15
Activations Density 0.100%