INDEX
Explanations
requiring effort, skill, or specific conditions
New Auto-Interp
Negative Logits
'
0.46
’
0.45
структура
0.41
ган
0.40
ג
0.39
ларда
0.38
adecuada
0.38
وا
0.37
метод
0.37
enfermedades
0.37
POSITIVE LOGITS
patience
0.55
collaboration
0.52
investment
0.50
cooperation
0.49
dedication
0.48
用到
0.47
attention
0.45
expertise
0.44
investments
0.44
fluency
0.43
Activations Density 0.013%