INDEX
Explanations
mentions of people's names and the roles they play
New Auto-Interp
Negative Logits
Allez
-0.38
Hej
-0.38
Réponses
-0.38
Nikita
-0.38
Côte
-0.37
São
-0.36
Minaj
-0.36
tara
-0.36
São
-0.36
zeb
-0.36
POSITIVE LOGITS
propOrder
0.68
Jesus
0.65
SequentialGroup
0.58
Jesus
0.57
Edg
0.57
Abel
0.53
Oscar
0.53
Flav
0.52
Mario
0.52
Edgar
0.51
Activations Density 0.278%