INDEX
Explanations
terms related to development, education, and organizational structures
New Auto-Interp
Head Attr Weights
0:0.04
1:0.01
2:0.05
3:0.17
4:0.02
5:0.08
6:0.02
7:0.11
8:0.04
9:0.05
10:0.23
11:0.11
Negative Logits
uba
-0.84
ucl
-0.83
ornings
-0.82
�
-0.80
unte
-0.76
imon
-0.75
wic
-0.75
anta
-0.74
cpp
-0.74
icy
-0.73
POSITIVE LOGITS
liest
1.17
iest
1.04
forts
1.01
hest
1.00
osphere
0.96
portion
0.95
bandwagon
0.91
fallacy
0.91
fraternity
0.86
Mysteries
0.83
Activations Density 2.170%