INDEX
Explanations
preposition 'for' followed by 'you'
New Auto-Interp
Negative Logits
us
1.54
me
1.53
them
1.52
him
1.45
THEM
1.35
мене
1.29
them
1.27
мной
1.26
myself
1.26
comigo
1.25
POSITIVE LOGITS
спублі
0.75
offel
0.75
لوار
0.74
SIBILITY
0.72
ૃતિ
0.69
ามารถ
0.69
রীণ
0.68
眺
0.68
Revenir
0.68
erry
0.68
Activations Density 0.110%