INDEX
Explanations
former roles and past states
New Auto-Interp
Negative Logits
目前
0.53
on
0.52
perspectivas
0.49
Currently
0.46
forged
0.45
т
0.45
Currently
0.45
ুমাত্র
0.45
ไม่มี
0.44
currently
0.44
POSITIVE LOGITS
ehemaligen
0.70
býval
0.61
autrefois
0.59
former
0.58
formerly
0.58
முன்னாள்
0.57
माजी
0.56
преж
0.56
ehemalige
0.56
former
0.54
Activations Density 0.076%