INDEX
Explanations
efficiency or frequency analysis
New Auto-Interp
Negative Logits
финансо
0.34
💸
0.28
financeiros
0.28
финансовых
0.28
💰
0.28
🎸
0.28
、
0.28
🚀
0.28
事件
0.27
ఆర్థిక
0.27
POSITIVE LOGITS
it
0.34
its
0.31
vague
0.30
phobic
0.29
implied
0.29
inferior
0.29
ideology
0.29
other
0.29
agama
0.29
preconceived
0.28
Activations Density 0.000%