INDEX
Explanations
roles and titles related to positions of authority or employment
New Auto-Interp
Negative Logits
featureID
-0.62
disambiguazione
-0.57
ThroughAttribute
-0.54
ContentAlignment
-0.48
WindowEvent
-0.47
scand
-0.47
UnknownFields
-0.47
Klon
-0.46
remp
-0.46
tama
-0.46
POSITIVE LOGITS
RectangleBorder
0.74
devenir
0.73
KURZBESCHREIBUNG
0.71
endphp
0.70
head
0.70
diventare
0.70
soyez
0.67
Menjadi
0.66
acting
0.65
一名
0.65
Activations Density 0.222%