INDEX
Explanations
references to individuals and their roles within a narrative context
New Auto-Interp
Negative Logits
atak
-0.15
çĻ
-0.15
ÑĢа
-0.15
åį·
-0.14
icios
-0.14
Podesta
-0.14
antha
-0.14
ÑĪов
-0.14
igham
-0.14
andon
-0.14
POSITIVE LOGITS
Bram
0.27
Pi
0.26
Bart
0.23
Fem
0.23
Ru
0.23
Harm
0.23
Rut
0.23
Ton
0.23
Bert
0.23
Bas
0.23
Activations Density 0.017%