INDEX
Explanations
words related to professions or roles
words related to professions and roles
New Auto-Interp
Negative Logits
guiActiveUn
-0.73
gol
-0.62
Obj
-0.61
DeL
-0.60
COUR
-0.60
defect
-0.59
inev
-0.59
Tuc
-0.59
é¾įå
-0.58
Belg
-0.58
POSITIVE LOGITS
ners
0.84
rina
0.78
ums
0.78
eper
0.77
ira
0.76
lla
0.75
urs
0.75
ft
0.74
e
0.74
aires
0.74
Activations Density 0.138%