INDEX
Explanations
references to specific characters and events in a narrative context
New Auto-Interp
Negative Logits
flat
-0.37
beginnetje
-0.33
udaler
-0.32
Flat
-0.32
jardim
-0.31
flat
-0.30
Kariera
-0.29
joined
-0.29
radio
-0.28
Auth
-0.28
POSITIVE LOGITS
Capcom
0.79
MLLoader
0.77
OGND
0.65
ChrTalk
0.65
transQ
0.60
Efq
0.60
Verſ
0.58
himſelf
0.57
незавершена
0.56
tanleria
0.56
Activations Density 0.005%