INDEX
Explanations
abbreviations and acronyms related to various organizations or concepts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.06
2:0.11
3:0.09
4:0.02
5:0.02
6:0.16
7:0.11
8:0.15
9:0.06
10:0.09
11:0.07
Negative Logits
<@
-1.03
sip
-0.88
ItemTracker
-0.86
sharp
-0.85
(@
-0.84
limp
-0.84
blunt
-0.84
whim
-0.83
Aram
-0.82
petertodd
-0.80
POSITIVE LOGITS
depending
1.18
respectively
0.99
WOR
0.96
UK
0.90
weather
0.90
Republic
0.89
��
0.89
Kid
0.88
wagen
0.87
inburgh
0.87
Activations Density 0.074%