INDEX
Explanations
phrases indicating persistence or continuity in a state or condition
New Auto-Interp
Negative Logits
]();
-0.80
التالي
-0.68
περ
-0.67
aiuto
-0.66
dze
-0.66
cil
-0.61
насељу
-0.60
робнее
-0.59
suz
-0.59
Filler
-0.58
POSITIVE LOGITS
remains
2.64
remain
2.55
remains
2.37
remained
2.16
remain
2.15
Remains
2.14
Remain
2.10
Remain
1.80
rimane
1.56
bleibt
1.55
Activations Density 0.053%