INDEX
Explanations
references to military actions or involvement
political reporting and social connections
New Auto-Interp
Negative Logits
nahilalakip
-0.80
gyhoeddwyd
-0.53
surla
-0.51
tesettür
-0.50
Administrativna
-0.50
satılık
-0.49
Exacts
-0.48
initComponents
-0.47
TextAppearance
-0.47
Rüyada
-0.47
POSITIVE LOGITS
+#+#
0.51
hen
0.35
transQ
0.34
zvuky
0.34
subpackage
0.34
xnn
0.33
YM
0.33
svc
0.32
fhir
0.32
]-->
0.31
Activations Density 0.024%