INDEX
Explanations
team, colleague, employee, work
New Auto-Interp
Negative Logits
deney
0.49
mansions
0.47
museums
0.46
scenery
0.46
gard
0.44
leuc
0.43
entertained
0.42
insults
0.41
marvellous
0.41
philanthropist
0.41
POSITIVE LOGITS
同事
0.83
colleague
0.71
업무
0.70
colleagues
0.68
员工
0.66
Employees
0.65
Employees
0.64
Employee
0.63
Employee
0.63
employee
0.62
Activations Density 0.316%