INDEX
Explanations
acronyms and abbreviations related to organizations and institutions
New Auto-Interp
Head Attr Weights
0:0.08
1:0.02
2:0.12
3:0.09
4:0.05
5:0.11
6:0.04
7:0.04
8:0.08
9:0.13
10:0.13
11:0.04
Negative Logits
adversity
-1.20
Increase
-1.14
=-=-=-=-
-1.08
strangers
-1.03
majesty
-1.02
Stranger
-1.01
guiName
-0.99
eryl
-0.98
mov
-0.97
gimm
-0.95
POSITIVE LOGITS
):
1.47
)—
1.46
emale
1.43
),
1.35
+)
1.32
UGC
1.30
)--
1.25
).[
1.24
)
1.23
sonian
1.21
Activations Density 0.084%