INDEX
Explanations
references to various national institutes and their respective research areas
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.12
3:0.04
4:0.05
5:0.02
6:0.23
7:0.29
8:0.03
9:0.04
10:0.04
11:0.05
Negative Logits
soDeliveryDate
-1.75
subcommittee
-1.63
pause
-1.54
sidel
-1.52
deadlines
-1.50
outnumbered
-1.47
portion
-1.46
malls
-1.45
ipment
-1.44
loophole
-1.43
POSITIVE LOGITS
Acc
1.59
Ir
1.59
Known
1.55
龍�
1.48
Corruption
1.46
Cure
1.44
Spawn
1.42
Chicken
1.39
ilk
1.37
Principles
1.37
Activations Density 0.008%