INDEX
Explanations
phrases indicating formal reviews or assessments
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.04
3:0.30
4:0.02
5:0.03
6:0.07
7:0.15
8:0.04
9:0.10
10:0.07
11:0.10
Negative Logits
ulhu
-1.44
imeters
-1.35
���
-1.34
カ
-1.32
����
-1.30
ppa
-1.25
agne
-1.24
inches
-1.20
フォ
-1.18
atos
-1.13
POSITIVE LOGITS
agre
1.49
ommel
1.39
antioxid
1.30
pse
1.30
DragonMagazine
1.27
opin
1.25
essional
1.25
Merge
1.23
dossier
1.21
1.20
Activations Density 0.014%