INDEX
Explanations
mentions of organizational structure and event participation
New Auto-Interp
Negative Logits
олов
-0.17
iyas
-0.16
erten
-0.16
olley
-0.14
ÏĦÏģο
-0.14
浦
-0.14
subj
-0.14
517
-0.14
liga
-0.14
jit
-0.14
POSITIVE LOGITS
ima
0.16
anton
0.15
AreaView
0.14
avier
0.14
jax
0.14
/loose
0.14
vet
0.13
еÑģа
0.13
achen
0.13
HEME
0.13
Activations Density 0.565%