INDEX
Explanations
phrases indicating choice or comparison between options
choice between items
New Auto-Interp
Negative Logits
Église
-0.39
Outro
-0.37
relev
-0.37
dAtA
-0.36
Kaynakça
-0.36
AssemblyCulture
-0.36
Record
-0.36
Funk
-0.36
Fim
-0.35
STM
-0.35
POSITIVE LOGITS
inSlope
0.59
wyboru
0.58
escolha
0.56
RTLR
0.53
escoger
0.53
prefier
0.52
tercih
0.52
setupUi
0.51
either
0.51
createState
0.51
Activations Density 0.017%