INDEX
Explanations
repetitive phrases indicating ongoing actions or continuity
New Auto-Interp
Negative Logits
còn
-0.18
ott
-0.18
енз
-0.17
ola
-0.17
akhir
-0.16
lant
-0.15
contin
-0.15
continuing
-0.15
_contin
-0.15
Continuing
-0.15
POSITIVE LOGITS
efforts
0.20
ä¸ĭåİ»
0.18
along
0.17
lbrace
0.16
to
0.16
ued
0.16
857
0.16
down
0.16
momentum
0.16
875
0.16
Activations Density 0.038%