INDEX
Explanations
references to academic institutions and scholarly related terms
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.05
3:0.07
4:0.16
5:0.03
6:0.13
7:0.15
8:0.04
9:0.05
10:0.07
11:0.15
Negative Logits
gaard
-1.54
iasco
-1.35
oxide
-1.28
chwitz
-1.28
adow
-1.26
ブ
-1.22
ghan
-1.21
mistakenly
-1.19
シャ
-1.17
ollower
-1.16
POSITIVE LOGITS
bots
1.64
Journals
1.47
Authors
1.41
acad
1.32
Jaw
1.31
indust
1.31
Synd
1.30
Alphabet
1.27
glers
1.27
devices
1.27
Activations Density 0.003%