INDEX
Explanations
forms of "is" and "becomes"
New Auto-Interp
Negative Logits
trening
0.45
ajutor
0.42
assurance
0.42
캠
0.41
دانست
0.41
anbieten
0.40
nomen
0.39
вопросы
0.39
nevoie
0.39
と考えて
0.39
POSITIVE LOGITS
είναι
0.73
blir
0.66
becomes
0.61
bliver
0.61
fung
0.57
ligger
0.57
betyder
0.57
viene
0.54
يكون
0.53
является
0.52
Activations Density 0.001%