INDEX
Explanations
favorite topics, passions, or choices
New Auto-Interp
Negative Logits
развития
0.39
bioavailability
0.39
investissement
0.39
調べて
0.39
мани
0.38
Konstantin
0.37
ارتباط
0.37
consequences
0.37
涉及到
0.37
ойнотуу
0.37
POSITIVE LOGITS
preferred
0.92
favorite
0.89
prefer
0.84
prefers
0.84
bevorzug
0.83
preferred
0.79
preference
0.79
favourite
0.78
favorito
0.78
preferences
0.78
Activations Density 0.386%