INDEX
Explanations
future action or consequence
New Auto-Interp
Negative Logits
checking
0.46
Estamos
0.46
我想
0.45
পারছি
0.44
ofertas
0.43
내가
0.41
το
0.41
我自己
0.41
제가
0.41
טי
0.41
POSITIVE LOGITS
likely
1.09
typically
0.95
usually
0.84
likely
0.80
also
0.80
generally
0.75
probably
0.74
обычно
0.74
Likely
0.72
hopefully
0.72
Activations Density 0.043%