INDEX
Explanations
abbreviations and initialisms related to computing or technology
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.05
3:0.04
4:0.04
5:0.03
6:0.47
7:0.03
8:0.03
9:0.04
10:0.09
11:0.07
Negative Logits
Croatian
-1.49
Finnish
-1.41
Norwegian
-1.36
velt
-1.33
Nikol
-1.28
Danish
-1.20
terror
-1.19
Swedish
-1.18
Serbian
-1.18
Benz
-1.17
POSITIVE LOGITS
wcs
1.95
illin
1.59
nown
1.58
ciating
1.51
arro
1.49
γ
1.45
ilater
1.44
atters
1.37
TAG
1.35
nces
1.35
Activations Density 0.011%